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INTRODUCTION. 


This book will describe the recent proof of Fermat’s Last The- 
orem by Andrew Wiles, aided by Richard Taylor, for graduate 
students and faculty with a reasonably broad background in al- 
gebra. It is hard to give precise prerequisites but a first course 
in graduate algebra, covering basic groups, rings, and fields to- 
gether with a passing acquaintance with number rings and va- 
rieties should suffice. Algebraic number theory (or arithmetical 
geometry, as the subject is more commonly called these days) 
has the habit of taking last year’s major result and making it 
background taken for granted in this year’s work. Peeling back 
the layers can lead to a maze of results stretching back over the 
decades. 

I attended Wiles’ three groundbreaking lectures, in June 1993, 
at the Isaac Newton Institute in Cambridge, UK. After return- 
ing to the US, I attempted to give a seminar on the proof 
to interested students and faculty at the University of Illinois, 
Urbana-Champaign. Endeavoring to be complete required sev- 
eral lectures early on regarding the existence of a model over 
Q for the modular curve Xq(N) with good reduction at primes 
not dividing N. This work hinged on earlier work of Zariski 
from the 1950’s. The audience, keen to learn new material, did 
not appreciate lingering over such details and dwindled rapidly 
in numbers. 

Since then, I have taught the proof in two courses at UIUC, 
a two- week summer workshop at UIUC (with the help of Chris 
Skinner of the University of Michigan), and most recently a 



course in spring 2003 at the University of Wisconsin - Madi- 
son. To avoid getting bogged down as in the above seminar, it 
is necessary to assume some background. In these cases, refer- 
ences will be provided so that the interested students can fill 
in details for themselves. The aim of this work is to convey the 
strong and simple line of logic on which the proof rests. It is 
certainly well within the ability of most graduate students to 
appreciate the way the building blocks of the proof go together 
to give the result, even though those blocks may themselves be 
hard to penetrate. If anything, this book should serve as an 
inspiration for students to see why the tools of modern arith- 
metical geometry are valuable and to seek to learn more about 
them. 

An interested reader wanting a simple overview of the proof 
should consult Gouvea [13], Ribet [25], Rubin and Silverberg 
[26], or my article [1]. A much more detailed overview of the 
proof is the one given by Darmon, Diamond, and Taylor [6], and 
the Boston conference volume [5] contains much useful elabora- 
tion on ideas used in the proof. The Seminaire Bourbaki article 
by Oesterle and Serre [22] is also very enlightening. Of course, 
one should not overlook the original proof itself [38], [34] . 
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1 

History and Overview 


It is well-known that there are many solutions in integers to x 2 + 
y 2 = 2 2 , for instance (3, 4, 5), (5, 12, 13). The Babylonians were 
aware of the solution (4961, 6480, 8161) as early as around 1500 
B.C. Around 1637, Pierre de Fermat wrote a note in the margin 
of his copy of Diophantus’ Arithmetica stating that x n +y n = z n 
has no solutions in positive integers if n > 2. We will denote 
this statement for n ( FLT) n . He claimed to have a remarkable 
proof. There is some doubt about this for various reasons. First, 
this remark was published without his consent, in fact by his 
son after his death. Second, in his later correspondence, Fermat 
discusses the cases n = 3, 4 with no reference to this purported 
proof. It seems likely then that this was an off-the-cuff comment 
that Fermat simply omitted to erase. Of course ( FLT) n implies 
(FLT) an , for a any positive integer, and so it suffices to prove 
(FLT ) 4 and ( FLT\ for each prime number t > 2. 
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vi 


First, we must deal with the equation x 2 + y 2 = z 2 . We may 
assume x, y , and £ are positive and relatively prime (since 
otherwise we may divide out any common factors because the 
equation is homogeneous), and we see that one of x or y is even 
(since otherwise z 2 = 2 (mod 4), which is a contradiction). 
Suppose that x is even. Then 

rv) m - (i)' 

with relatively prime factors on the left hand side and a square 
on the right hand side. Hence 

z-y ,2 z + y 2 

with a, b e Z + . Then y = a 2 — b 2 , 2 = a 2 + b 2 , and x = 2 ab. 

[Alternatively, if x 2 + y 2 = z 2 , then (x + iy)/z has norm 1, 
and so by Hilbert’s Theorem 90, 

x + iy a + ib ( a 2 — b 2 )+i 2 ab 

z a — ib a 2 + b 2 

which yields the same result.] 

Theorem 1.1 x A + y A = z 2 has no solutions with x,y,z all 
nonzero, relatively prime integers. 


x 2 = 2 ab 
y 2 = a 2 — b 2 
z = a 2 + b 2 


This implies [FLT) 4. 
Proof: Say 
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with a and b relatively prime. Clearly, b is even (y is odd, since 
x is even) , and from a 2 = y 2 + b 2 we get 

b = 2 cd 
y = c 2 — cl 2 
a = c 2 + d 2 

with c and d relatively prime. Hence 

x 2 = 2 ab = 4cd(c 2 + d 2 ) 

with c, d, and c 2 + d 2 relatively prime. Then 

c = e 2 
d = f 2 

c 2 + d 2 = g 2 

whence e 4 + / 4 = g 2 . 

Note, however, that z > a 2 = ( g 2 ) 2 > g, and so we are done 
by infinite descent (repeated application produces an infinite 
sequence of solutions with ever smaller positive integer z, a 
contradiction). QED 

1.2 Proof of ( FLT) 3 

The first complete proof of this case was given by Karl Gauss. 
Leonhard Euler’s proof from 1753 was quite different and at 
one stage depends on a fact that Euler did not justify (though 
it would have been within his knowledge to do so). We outline 
the proof - details may be found in [16], p. 285, or [23], p. 43. 

Gauss’s proof leads to a strategy that succeeds for certain 
other values of n too. We work in the ring A = Z[(] = {a + b( : 
a, be Z}, where ( is a primitive cube root of unity. The key fact 
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here is that A is a PID and hence a UFD. We also repeatedly 
use the fact that the units of A are precisely ±<Q (i = 0 , 1 , 2 ). 

Theorem 1.2 x 3 + y 3 = uz 3 has no solutions with x,y,z E A, 
u a unit in A, xyz 0. 

This certainly implies (FLT) 3 . 

Proof: By homogeneity, we may assume that x,y, z are rela- 
tively prime. Factoring x 3 + y 3 = uz 3 gives 

(x + y)(x + C y)(x + c 2 y) = UZ 3 , 

where the gcd of any 2 factors on the left divides A := 1 — C- 
If each gcd is 1, then each factor is a cube up to a unit. In 
any case, A is “small” in that |T/(A)| = 3. In particular, each 
element of A is either 0, ±1 mod A. 

Lemma 1.3 There are no solutions when A fxyz. 

Proof: If x = 1 (mod A), say x = 1 + A a, then 

x 3 — 1 = A 3 o(1 + a) (a — ( 2 ) 

= A 3 a(l + a)(a — 1) (mod A 4 ) 

= 0 (mod A 4 ), 

Plugging back in the equation, (±1) + (±1) = ±u (mod A 4 ), 
impossible since none of the 6 units u in A are 0 or ±2 mod 
A 4 ). QED 

Lemma 1.4 Suppose A fxy, X\z. Then X 2 \z. 

Proof: Consider again ±1 ± 1 = uz 3 (mod A 4 ). If the left 
side is 0, then A 4 |z 2 , so A 2 |z. If the left side is ±2, then A|2, 
contradicting |T/(A)| = 3. QED 
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IX 


Lemma 1.5 Suppose that A /| xy, \ k \\z, k > 2. Then there 
exists a solution with A )(xy, \ k ~ 1 \\z. 

Proof: In this case, the gcd of any 2 factors on the left is A. 
Hence we can assume that 

(l)x + y = uicSP 
(2)x + (y = u 2 (3 3 \ 

(3)x + C 2 y = w 3 7 3 A, 

where 17, w 2 , and w 3 are units, t = 3k — 2, and A /|a, /?, 7. 
(1)+C(2)+C 2 (2) yields (setting x\ = /3,yi = 7, and z\ = aA fc_1 ) 

z? + eiVi = e 24 

with ei, 62 units. Reducing mod A 2 , we get 

zhl zh 6i = 0, 

which implies that ei = ±1. Replacing y\ by —y\ if necessary, 
we get 

x\ + y\ = e 2 £ 3 . 

QED 

Finally, to prove the theorem, if A J(xyz, we use lemma 1.2. If 
A /f xy but A | z, we use lemmas 1.3 and 1.4. If X\x then A /f yz, 
hence mod A 3 

0 zb 1 = zhrt 

which implies that u = zhl (mod A 3 ), and hence u = zhl. 
Rearranging yields 

(±zf + (-yf = x 3 , 

a case which has already been treated. QED 
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1.3 Further Efforts at Proof 

Peter Dirichlet and Adrien Legendre proved (FLT) 5 around 
1825, and Gabriel Lame proved (FLT) 7 around 1839. If we set 
( = (£ prime), and 

Z[C] = {ao + aiC + • • • + cii- 2 C 1 2 '■ ai E Z}, 

then there are cases when Z[£] is not a UFD and the factor- 
ization method used above fails. (In fact, Z[£] is a UFD if and 
only if £ < 19.) 

It turns out that the method can be resuscitated under weaker 
conditions. In 1844 Ernst Kummer began studying the ideal 
class group of Q(C), which is a finite group that measures how 
far Z[£] is from being a UFD [33]. Between 1847 and 1853, 
he published some masterful papers, which established almost 
the best possible result along these lines and were only really 
bettered by the recent approach detailed below, which began 
over 100 years later. In these papers, Kummer defined regu- 
lar primes and proved the following theorem, where h( Q(C)) 
denotes the order of the ideal class group. 

Definition 1.6 Call a prime £ regular if £ /\h(Q(()) ( where 
( = e 2m F). Otherwise, £ is called irregular. 

Remark 1.7 The first irregular prime is 37 and there are in- 
finitely many irregular primes. It is not known if there are in- 
finitely many regular primes, but conjecturally this is so. 

Theorem 1.8 (Kummer) (i) (FLT)^ holds if £ is regular. 

(2) £ is regular if and only if £ does not divide the numerator 
of Bi for any even 2 < i < £ — 3. 
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Here B n are the Bernoulli numbers defined by 

-r~r = E(S„/n!)x”. 

For instance, the fact that B i2 = — shows that 691 is 
irregular. We shall see the number 691 appearing in many dif- 
ferent places. 

Here the study of FLT is divided into two cases. The first 
case involves showing that there is no solution with £ /\xyz. 
The idea is to factor x e + y ( = as 

{x + y)(x + (y) ■ ■ ■ [x + ( £ ~ l y) = z\ 

where ( = e The ideals generated by the factors on the 
left side are pairwise relatively prime by the assumption that 
£ /f xyz (since A := 1 — ( has norm £ - compare the proof of 
(FLT) 3), whence each factor generates an fith power in the ideal 
class group of Q(C)- The regularity assumption then shows that 
these factors are principal ideals. We also use that any for unit 
u in Z[C], ( s u is real for some s G Z. See [33] or [16] for more 
details. 

The second case involves showing that there is no solution to 
FLT for t\ xyz. 

In 1823, Sophie Germain found a simple proof that if t is 
a prime with 21+ 1 a prime then the first case of ( FLT 
holds. Arthur Wieferich proved in 1909 that if t is a prime with 
2 £ ~ 1 ^ 1 (mod l 2 ) then the first case of (FLT)? holds. Exam- 
ples of l that fail this are rare - the only known examples are 
1093 and 3511. Moreover, similar criteria are known if p^ 1 ^ 1 
(mod £ 2 ) and p is any prime < 89 [15]. This allows one to prove 
the first case of (FLT)? for many t. 

Before Andrew Wiles, (FLT)? was known for all primes 2 < 
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£ < 4 x 10 6 [3]; the method was to check that the conjecture of 
Vandiver (actually originating with Kummer and a refinement 
of his method) that £ J[h{ Q(C + C -1 )) holds for these primes. 
See [36]. The first case of ( FLT )g was known for all primes 
2 < i < 8.7 x 10 20 . 

1.4 Modern Methods of Proof 

In 1916, Srinivasa Ramanujan proved the following. Let 

OO OO 

A = q n (1 - Q n ) 24 = E r(n)q n . 

n = 1 n = 1 

Then 

r(n) = crii(n) (mod 691), 
where a k (n) = Ed\ n d k . 

A is a modular form; this means that, if we set q = e 27 ™ 2 , A 
satisfies (among other conditions) A (v+iD = (cz d) k A(z) for 
all £ in the upper half-plane Im(z) > 0 and all G T with, 
in this case, (“weight”) k = 12 and V = SL 2 (Z) (in general, we 
define a “level” N by having T defined as the group of matrices 
in SL 2 (Z) such that N\c, here N = 1). For instance, setting 
a,b,d = 1, c = 0, A (z + 1) = A(^), and this is why A can be 
written as a Fourier series in q = e 2mz . 

Due to work of Andre Weil in the 1940’s and John Tate in 
the 1950’s, the study of elliptic curves, that is curves of the 
form y 2 = g(x), where g is a cubic with distinct roots, led to 
the study of Galois representations, i.e. continuous homomor- 
phisms Gal( Q/Q) — > GL 2 (R ), where R is a complete local ring 
such as the finite held F/ or the ring of 1-adic integers Zg. In 
particular, given elliptic curve E dehned over Q (meaning the 
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coefficients of g are in Q), and any rational prime £, there exist 
associated Galois representations p^ iE '■ Gal( Q/Q) -> GL 2 ( Z e ) 
and (by reduction mod £) p iE : Gal (Q/Q ) — > GL 2 (Ff). These 
encode much information about the curve. 

A conjecture of .Jean-Pierre Serre associates to a certain kind 
of modular form / (cuspidal eigenforms) and to a rational prime 
£ a Galois representation, pij. All known congruences for r fol- 
low from a systematic study of the representations associated 
to A. This conjecture was proved by Pierre Deligne [7] (but 
note that he really only wrote the details for A - extensive 
notes of Brian Conrad http://www.math. Isa. umich.edu/~ bd- 
conrad/bc.ps can be used to fill in details here) in 1969 for 
weights k > 2. For k = 2 it follows from earlier work of Martin 
Eichler and Goro Shimura [31]. For k = 1 it was later estab- 
lished by Deligne and Serre [8]. 

These representations pij share many similarities with the 
representations p^ E - Formalizing this, a conjecture of Yutaka 
Taniyama of 1955, later put on a solid footing by Shimura, 
would attach a modular form of this kind to each elliptic curve 
over Q. Thus, we have the following picture 


{Repns from certain modular forms } C 


{Repns from elliptic curves} 

|n 

{Admissible Galois representat 


In 1985, Gerhard Frey presented a link with FLT. If we as- 
sume that a, b, c are positive integers with af + b ( ’ = cf and 
consider the elliptic curve y 2 = x(x — a e )(x + if) (called a Frey 
curve), this curve is unlikely to be modular, in the sense that 



1.4 Modern Methods of Proof xiv 


Pt E turns out to have properties that a representation associ- 
ated to a modular form should not. 

The Shimura-Taniyama conjecture , however, states that any 
given elliptic curve is modular. That is, given E, defined over Q, 
we consider its E-function E(E, s ) = E a n /n s . This conjecture 
states that E a n q n is a modular form. Equivalently, every p^E 
is a pij for some modular form /. 

In 1986, Kenneth Ribet (building on ideas of Barry Mazur) 
showed that these Frey curves are definitely not modular. His 
strategy was to show that if the Frey curve is associated to a 
modular form, then it is associated to one of weight 2 and level 
2. No cuspidal eigenforms of this kind exist, giving the desired 
contradiction. Ribet’s approach (completed by Fred Diamond 
and others) establishes in fact that the weak conjecture below 
implies the strong conjecture (the implication being the so- 
called e-conjecture) . The strong conjecture would imply many 
results - unfortunately, no way of tackling this is known. 

Serre’s weak conjecture [30] says that all Galois representa- 
tions p : Gal( Q/Q) — > GL 2 (k) with k a finite held, and such 
that det(p(r)) = —1, where r denotes a complex conjugation, 
(this condition is the definition of p being odd ) come from mod- 
ular forms. 

Serre ’s strong conjecture [30] states that p comes from a mod- 
ular form of a particular type (&, N, e) with k : N positive inte- 
gers (the weight and level, met earlier) and e : ( Z/NZ) X — > C x 
(the Nebentypus). In the situations above, e is trivial. 

In 1986, Mazur found a way to parameterize certain collec- 
tions of Galois representations by rings. Frey curves are semistable, 
meaning that they have certain mild singularities modulo primes. 
Wiles with Richard Taylor proved in 1994 that every semistable 
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elliptic curve is modular. 

In a picture we have (restricting to certain subsets to be de- 
fined later): 


{Certain semistable elliptic curves} 

|n 

{' Certain modular forms} C { Certain semistable Galois representations 

Wiles’ idea is, first, following Mazur to parametrize the sets 
on the bottom line by local rings T and 7Z. The inclusion trans- 
lates into a surjection from 7 £ — > T. Using some clever com- 
mutative algebra, Wiles obtains conditions for such a map to 
be an isomorphism. Using Galois cohomology and the theory 
of modular curves, it is checked that these conditions generally 
hold. The isomorphism of 7 Z and T translates back into the two 
sets on the bottom line being equal. It then follows that every 
semistable elliptic curve is modular. 

In particular our particular Frey curves are modular, contra- 
dicting the conclusion of Ribet’s work and establishing that 
counterexamples to Fermat’s Last Theorem do not exist. 

The Big Picture. An outline to the strategy of the proof has 
been given. A counterexample to Fermat’s Last Theorem would 
yield an elliptic curve (Frey’s curve) with remarkable proper- 
ties. This curve is shown as follows not to exist. Associated to 
elliptic curves and to certain modular forms are Galois repre- 
sentations. These representations share some features, which 
might be used to define admissible representations. The aim 
is to show that all such admissible representations come from 
modular forms (and so in particular the elliptic curve ones do, 
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implying that Frey’s curves are modular, enough for a contra- 
diction). We shall parametrize special subsets of Galois repre- 
sentations by complete Noetherian local rings and our aim will 
amount to showing that a given map between such rings is an 
isomorphism. This is achieved by some commutative algebra, 
which reduces the problem to computing some invariants, ac- 
complished via Galois cohomology. The first step is to define 
(abstractly) Galois representations. 
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Profinite Groups and Complete Local Rings 


2.1 Profinite Groups 

Definition 2.1 A directed set is a partially ordered set I such 
that for all i,j (E I there is a k G I with i < k and j < k. 

Example: Let G be a group. Index the normal subgroups of 
finite index by I. Say i > j if TV, C Nj. If k corresponds to 
Nk = Ni ft Nj then i,j < k, so we have a directed set. 

Definition 2.2 An inverse system of groups is a collection of 
groups indexed by a directed set I , together with group homo- 
morphisms 7pj : Gi — > Gj whenever i > j. We insist that 
7 in = Id, and that 7Tjk7Tij = 7r^. 

Example: Index the normal subgroups of finite index by I as 
above. Setting Gi = G/Nj , and 7T tJ : Gi — > Gj to be the natural 
quotient map whenever i > j, we get an inverse system of 
groups. 
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We now form a new category, whose objects are pairs ( H . {(f)j : 
i E /}), where H is a group and each fa : H —> Gi is a group 
homomorphism, with the property that 



commutes whenever i > j . Given two elements (H, {fa}) and 
we define a morphism between them to be a group 
homomorphism 9 : H — » J such that 



Gi 


commutes for alH E I. 

Example: Continuing our earlier example, (G, {4>i}) is an ob- 
ject of the new category, where fa : G — > G/Ni is the natural 
quotient map. 

Definition 2.3 lim Gi is the terminal object in the new cat- 
* __ 

egory, called the inverse limit of the Gi. That is , lim Gj is the 
unique object ( X , {xi}) such that given any object (H, { fa }) 
there is a unique morphism 

(H. {&})-(X,{*})- 

The existence of a terminal object in this category will be 
proved below, after the next example. 

Example: Continuing our earlier example, the group above, X, 
is the profinite completion G of G. Since G is terminal, there is a 
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unique group homomorphism G — > G. If this is an isomorphism 
then we say that G is profinite (or complete ). For instance, it 
will be shown below that Gal( Q/Q) is a profinite group. 

We have the following commutative diagram for every object 
(H, {fii}) of the new category and every X* C Nj, 


3 ! 



G contains all relevant information on finite quotients of G. 
G — > G is called the profinite completion of G. If we only use 
those finite quotients which are C-groups, then we obtain the 
pro-C completion of G instead. 

Exercise: Prove that G — » G is an isomorphism, so that G is 
profinite / complete. 

We return to the general case and we now need to prove the 
existence of lim G-, . To do this, let 

c = n g u 

iel 

and 7p : C — » Gi be the ith projection. Let X = {c G C|7r. y - (7r ? :(c)) 
7r j (c) Vi > j}. We claim that 

lim Gi = (X, {7q|x}). 

Proof: (i) (X, {tt 2 ;| x}) is an object in the new category, since X 
is a group (check!) and the following diagram commutes for all 
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i > j (by construction) 


X 



(ii) Given any (H, {4>i}) in the new category, define (j){h ) = 
((j)i(h))i£i, and check that this is a group homomorphism 0 : 
H — > X such that 



Gi 


and that (f) is forced to be the unique such map. QED 

Example: Let G = Z and let us describe G = Z. The finite 
quotients of G are G t = Z/z, and i > j means that j\i. Hence 

Z = {(ai, a2,a%, . . .)| cq G Z/z and a* = aj (mod j) whenever j\i}. 

Then for a G Z, the map an (a, a, a . . .) G Z is a homomor- 
phism of Z into Z. 

Now consider F p = U n F p n. Then, if m\n, 


Gal{ F p /F p ) rcstrictl01 > fea/(F p n/F p ) ^ 


restriction, Y 

Gal(F p m/F p ) = 


Z/n 
Z jm 


Note that Ga/(F p n/F p ) is generated by the Frobenius auto- 
morphism Fr : x i— > x p . We see that (Gal(F p /F p ), {0 n }) is an 
object in the new category corresponding to the inverse system. 
Thus there is a map Gal( F p /F p ) —> Z. 

We claim that this map is an isomorphism, so that Gal( F p /F p ) 
is profinite. This follows from our next result. 
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Theorem 2.4 Let L/K be a ( possibly infinite ) separable, al- 
gebraic Galois extension. Then Gal(L/K ) = Inn Gal(Lj/K), 
where the limit runs over all finite Galois subextensions Li/K. 

Proof: We have restriction maps: 

Gal(L/K) Gal (Li/K) 



Gal(Lj/K) 

whenever Lj C Lj, i.e. i > j. We use the projection maps to 
form an inverse system, so, as before, (Gal(L/K ), {0}) is an 
object of the new category and we get a group homomorphism 

Gal(L/K)-^> lim GcifiLfiK). 

We claim that f is an isomorphism. 

(i) Suppose 1 g G Gal(L/K). Then there is some x G L 

such that g(x) x. Let Lj be the Galois (normal) closure of 

K(x). This is a finite Galois extension of K, and 1 g\i t = 
<t>i(g), which yields that 1 fi(g)- Hence <f is injective. 

(ii) Take (gf) G lim Gal(Lj/ K) - this means that Lj C Lj => 
g t | L j = gj. Then define g G Gal (L/K) by g(x) = gfix) when- 
ever x G Lj. This is a well-dehned held automorphism and 
4>(g) = (gf). Thus f is surjective. QED 

For the rest of this section, we assume that the groups Gj are 
all finite (as, for example, in our running example). Endow the 
finite Gj in our inverse system with the discrete topology. Gj 
is certainly a totally disconnected Hausdorff space. Since these 
properties are preserved under taking products and subspaces, 
lim Gj C Y[Gj is Hausdorff and totally disconnected as well. 
Furthermore n Gj is compact by Tychonoff ’s theorem. 
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Exercise: If /, g : A — » B are continuous (A, B topological 
spaces) with A, B Hausdorff, then {x\ f{x) = g(x)} is closed. 
Deduce that 

lim Gi = HfcellGh 7Tp(7r*(c)) = 7Tj(c)} 

i>j 

is closed in n Gi, therefore is compact. In summary, lim Gj is a 
compact, Hausdorff, totally disconnected topological space. 

Exercise: The natural inclusion Z — » Z maps Z onto a dense 
subgroup. In fact, for any group G , its image in G is dense, but 
the kernel of G — > G need not be trivial. This happens if and 
only if G is residually finite (meaning that the intersection of 
all its subgroups of finite index is trivial). 

If we denote by Fr the element of Gal( F q /F q ) given by Fr(x) = 
x q , i.e. the Frobenius automorphism, then Fr does not gener- 
ate the Galois group, but the group which it does generate is 
dense (by the last exercise), and so we say that Gal(F q /F q ) 
is topologically finitely generated by one element Fr (and so is 
procyclic). 

2.2 Complete Local Rings 

We now carry out the same procedure with rings rather than 
groups and so define certain completions of them. Let R be a 
commutative ring with identity 1, I any ideal of R. For i > j 
we have a natural quotient map 

R/r^UR/P. 

These rings and maps form an inverse system (now of rings). 
Proceeding as in the previous section, we can form a new cate- 
gory. Then the same proof gives that there is a unique terminal 
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object, Rj = lim R/P , which is now a ring, together with a 
unique ring homomorphism R — > i?/, such that the following 
diagram commutes: 



Note that Rj depends on the ideal chosen. It is called the 
I-adic completion of R (do not confuse it with the localization 
of R at I). We call R (/-adically) complete if the map R — ■» Rj 
is an isomorphism. Then Rj is complete. If / is a maximal ideal 
m, then we check that Rj is local , i.e. has a unique maximal 
ideal, namely m R. This will be proven for the most important 
example, Z p (see below), at the start of the next chapter. 

Example: If R ! = Z, m = pZ. p prime, then Rj = Z p , the p-adic 
integers. The additive group of this ring is exactly the pro-p 
completion of Z as a group. In fact, 

Z p = {(aq, a 2 , . . .) : a % G Z /p\ a % = aj (mod p 1 ) if i > j}. 

Note that Z will always be used to mean the prohnite (rather 
than any i-adic) completion of Z. 

Exercise: Let R = Z and / = 6Z. Show that the /-adic com- 
pletion of Z is isomorphic to Z 2 x Z3 and so is not local (it’s 
called semilocal). 

Exercise: Show that Z = Lb Z p , the product being over all 
rational primes. 

Exercise: Show that the ideals of Z p are precisely {0} and p l Z( 
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(i > 0). (This also follows from the theory developed in chapter 
3.) 

We will be interested in the category C whose objects are the 
complete local Noetherian rings with a given finite residue field 
(that is, the ring modulo its maximal ideal) k. In this category a 
morphism is required to make the following diagram commute: 



where the vertical maps are the natural projections. This is 
equivalent to requiring that </>( m#) C m^, where m# (respec- 
tively my) is the maximal ideal of R (respectively S). 

As an example, if k = F^>, then is an object of C. By a 
theorem of Cohen [2], the objects of C are of the form 

W(k)[[Ti, . . . ,T m ]]/ (ideal), 

where W{k ) is a ring called the ring of infinite Witt vectors 
over k (see chapter 3 for an explicit description of it). Thus 
W(k) is the initial object of C. In the case of F p , W(F P ) = Z p . 

Exercise: If R is a ring that is /-adically complete, then GL n (R ) = 
lim . GL n (R/P ) (the maps GL n [R/I l ) — > GL n (R/I j ) being the 
natural ones). 

Note that the topology on R induces the product topology on 
M n (R ) and thence the subspace topology on GL n (R). 

The Big Picture. We shall seek to use continuous group 
homomorphisms ( Galois representations) 


Gal{ Q/Q) -► GL n (R), 
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where R is in some C, to parametrize the homomorphisms that 
elliptic curves and modular forms naturally produce. In this 
chapter we have constructed these groups and rings and ex- 
plained their topologies. Next, we study the internal structure 
of both sides, notably certain important subgroups of the left 
side. This will give us the means to characterize Galois repre- 
sentations in terms of their effect on these subgroups. 
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We begin with a short investigation of Z p . A good reference for 
this chapter is [28]. 

We first check that p n Z p is the kernel of the map Z p — > Z /p n . 
Hence, we have 

2 p O pZ p D p Z p . . . 

If x E p n Z p — p n+1 Z p , then we say that the valuation of x, 
v(x) = n. Set v(0) = oo. 

Exercise: x is a unit in Z p if and only if v(x) = 0 

Corollary 3.1 Every x G Z p — {0} can be uniquely written as 
p v ^u where u is a unit. 

In fact 

(*) : (1 )v(xy) = v(x) + v(y), (2)v(x + y) > min(n(x), v(y)). 

We define a metric on Z p as follows: set d{x, y) = (E^-v) for 
a fixed 0 < c < l,i ^ j/ G Z p (d(x,x) = 0). We have some- 
thing stronger than the triangle inequality, namely d(x, z) < 
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ma x(d(x,y),d(y,z)) for all x,y : z G Z p . This has unusual con- 
sequences such as that every triangle is isosceles and every point 
in an open unit disc is its center. 

The metric and prohnite topologies then agree, since p n Z p is a 
base of open neighborhoods of 0 characterized by the property 
v(x) > n <==> d( 0, x) < c n . 

By (*)(1), Z p is an integral domian. Its quotient field is called 
Qp, the field of p-adic numbers. We have the following diagram 
of inclusions. 


Q c — "Q 

A 

Q c — 


z c 


•p 


This then produces restriction maps 
*) Gal(Q p /Q p ) — > Gal( Q/Q). 


We can check that this is a continuous group homomorphism 
(defined up to conjugation only). Denote Gal(K/K ) by Gk- 

Definition 3.2 Given a continuous group homomorphism p : 
G q — > GL n (R), (*) yields by composition a continuous group 
homomorphism p p : G q p — > GL n (R). The collection of homo- 
morphisms p p , one for each rational prime p, is called the local 
data attached to p. 

The point is that G q p is much better understood than G q; 
in fact even presentations of G q p are known, at least for p 2 
[18]. We next need some of the structure of Gq p and obtain 
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this by investigating finite Galois extensions K of Q p and how 
Gal(K/Q p ) acts. 

Now, 

Z p — {0} = {p n u\n > 0, u is a unit}, 


whence 

Q p — {0} = {p n u\u is a unit inZ p }. 

We can thus extend v : Z p — {0} — > N to a map v : Q p — > Z 
which is a homomorphism of groups. 


Definition 3.3 i discrete valuation w on a field K is a sur- 
jective homomorphism w : K x — » Z such that 


w(x + y) > min (w(x),w(y) 
for all x,y E K (we take w( 0) = 00 / 


Example: The map v : Q p — > Z is a discrete valuation, since 
(*) extends to Q p . 

Exercise: If K has a discrete valuation then 

A = {x e K\w(x) > 0} 

is a ring, and 

m = {x G K\w(x) > 0} 

is its unique maximal ideal. Choose 7 r so that w (n) = 1. Every 
element x E K x can be uniquely written as x = 7 t w ^u, where 
u is a unit in A. 

Remark: A : A/m, and 7r are called respectively the valua- 
tion ring, the residue field, and a uniformizer of w. As with 
the valuation on Q p , w yields a metric on K. An alterna- 
tive way of describing the elements of A is by the power se- 
ries {cq + Ci7r + C27T 2 + . . . |cj E S} , where S C A is chosen 



3. Infinite Galois Groups: Internal Structure xxix 


to contain exactly one element of each coset of A/m. The 
connection with our approach is by mapping the typical el- 
ement to (co (mod 7r),co + cpix (mod 7 t) 2 ,co + c\n + 

(mod 7r ) 3 , . . .) G Y[A/m l = A. The description extends to K 
by having K = {E£2/v c dL}, Le. Laurent series in 7r, coefficients 
in S. 


Corollary 3.4 The ideal m is equal to the principal ideal fn), 
and all ideals of A are of the form fx n ), so A is a PID. 

Let K/Qp be a finite Galois extension, and define a norm 
N : K x — > Q* by 

x t— > n a ( x ) 

ctGG 

where G = Gal(K/Q p ). The composition of homomorphisms 

K* XU -J4 Z 


is nonzero with some image / Z. We then define w : K x — > Z 
by w = jv o N. Then re is a discrete valuation on K. f is 
called the residue degree of K , and we say that w extends v 
with ramification index e if w |q = ev. For any x E Q p , we 
have 


1 l fi 

ev(x) = w(x) = —v(N{x)) = — v(x n ) = -jv(x), 


so that ef = n = [K : Q p . 


Proposition 3.5 The discrete valuation w is the unique dis- 
crete valuation on K which extends v. 


Proof: A generalization of the proof that any two norms on a 
finite dimensional vector space over C are equivalent ([4], [29] 
Chap. II). QED 
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Exercise: Let A be the valuation ring of K , and m the maximal 
ideal of A. Prove that the order of A/m is pf 
We have a collection of embeddings as follows: 



We note that the action of G on K satisfies w(<r(x)) = w(x), 
for all a £ G, x G K , since w and w o o both extend v (or by 
using the explicit definition of w). 

This property is crucial in establishing certain useful sub- 
groups of G. 

Definition 3.6 The ith ramification subgroup ofG = Gal(K/Q p ) 
is 

Gi = {a E G\w(a(x) — x) > i + 1 for all x £ A}, 
for i = — 1, 0, . . .. (See [29], Ch. IV.) 

Gi is a group, since 1 6 G*, and if <r, r E G, then 

w(<tt(x) — x) = w{ct{t{x ) — x) + (cr(x) — x)) 

> min(u;(cr(r(.T) — x),w(a(x) — x)) 

T i T 1, 

so that ar £ Gi. This is sufficient since G is finite. Moreover, 

— x = a(ra^ 1 (x) — u _1 (x)) 

= u(r(cr~ 1 (x)) — u _1 (x)) 

shows that Gi < G. 

We have that 

G = GU D G 0 D Gi D • • • , 
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where we call Go the inertia subgroup of G, and Gi the wild 
inertia subgroup of G. 

Exercise: Let 7r be a uniformizer of K. Show that in fact 

Gi = {<t G G|ie(cr(7r) — 7i) > i + 1}. 

These normal subgroups determine a filtration of G, and we 
now study the factor groups in this filtration. 

Theorem 3.7 (a) The quotient G/Gq is canonically isomor- 
phic to Gal{k/ F p ), with k = A/m., hence it is cyclic of order 

/• 

(b) Let Uo be the group of units of A. Then Ui = 1 + {Tj 
(i > 1) is a subgroup ofUo. For all a G G, the map a i— > (t(h)/ii 
induces an injective group homomorphism Gi/Gi + \ Ui/Ui + \. 

Proof: (a) Let cr G G. Then a acts on A, and sends m to 
m. Hence it acts on A/m = k. This defines a map (f) : G — > 
Gal(k/F p ) by sending a to the map x + m i— > cr(x) + m. We 
now examine the kernel of this map. 

ker f = {cr e G\a{x) — x G m for all x G ^4} 

= {a G G\w(cr(x) — x) > 1 for all x G A} = Gq. 


This shows that f induces an injective homomorphism from 
G/G 0 to Gal(k/F p ). 

As for surjectivity, choose a G A such that the image a of a 
in k has k = F p (a). Let 

P(x) = II ( x ~ 

creG 

Then p(x) is a rnonic polynomial with coefficients in A, and 



3. Infinite Galois Groups: Internal Structure xxxii 


one root is a. Then 

p{x) = ( x ~ a ( a )) £ k[x\ 
a&G 

yields that all conjugates of a are of the form a (a). For r G 
Gal(k/F p ), r(a) is such a conjugate, whence it is equal to some 
a (a). Then the image of a is r. 

(b) <7 G Gi <G=> w(a(n) — n) > i + 1 -<=> a(n)/n G [/*. 

This map is independent of choice of uniformizer. Suppose 
7t' = 7TU is another uniformizer, where u is a unit. Then 

<j( 7r') (j{Tl)(j{u) 

7r' 7 T U 

but for <r G Gi , we have that i + 1 < w(<r(u) — u) = w(a(u)/u — 
1) + u?(w), so that (j{u)/u G Ui + 1 , i.e. a(7r / )/7r / differs from 
by an element of C 4 +- 1 - 
The map dj is a homomorphism, since 

crr(7r) <j(7t) r(7r) a(u) 

7T 7T 7T U 

where u = t(7t)/tt, and, as above, cr(u)/u G U l+ \. 

Finally the map 0j is injective. To see this assume that cr(7r)/7T G 
Ui+ 1 - Then cr(7r) = 7r(l + y ) with y G (7r t+1 )- Hence rr ( tt) — n = 
Try has valuation at least f + 2, that is, a G G* + i. QED 

Proposition 3.8 We have that Uq/Ui is canonically isomor- 
phic to k x , and is thus cyclic of order pf — 1, and for i > 1, 
Ui/Ui + 1 embeds canonically into n l /n l+l = A: + ; and so is ele- 
mentary p-abelian. 

Proof: The map x i— > x (mod m) takes Uq to k x , and is 
surjective. Its kernel is {x\x = 1 (mod m)} = Ui, whence 
HoM = /c x . 
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The map 1 + x i— > x (mod n} l+1 takes U{ to ir l /ir l+1 and 
is surjective with kernel U l+ \. Moreover, since n acts trivially 
on TT l /ir l+1 , this is an (M/m)-module (i.e. A;- vector space). Its 
dimension is 1, since otherwise there would be an ideal of A 
strictly between tt* and ir l+1 . QED 
We note in particular that G is solvable, since the factors in its 
filtration are all abelian by the above. Thus, G q p is prosolvable. 
Specifically, we have the following inclusions: 


G 

cyclic | 



I 


cyclic order prime to p 

Gi 

p— group! 




{i 


i 


Corollary 3.9 The group G = Gal(K/Q p ) is solvable. More- 
over, its inertia subgroup Gq has a normal Sylow p-subgroup 
(namely G\) with cyclic quotient. 


To study continuous homomorphisms p p : Gq p — > GL, 2 (R), 
we are looking at finite quotients Gal(K/Q p ) of G q . If we can 
next dehne ramification subgroups for the infinite Galois group 
G q , then we can describe properties of p p in terms of its effect 
on these subgroups. 
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3.1 Infinite extensions 

Let Q p C M C L be finite Galois extensions, with respective 
valuation rings Am , A l- We have a restriction map 

Gal[Lj Q p ) — » Gal(M/Q p ) 

Ul Ul 


where the horizontal map is restriction. In general the image of 
G\ L ^ under this restriction map does not equal g[ M ^ ; in order to 
have this happen, we need the upper numbering of ramification 
groups (see Chapter IV of [29]). 

It is, however, true that the image of G\ L ' > equals g[ M ^ for 
i = 0,1. This follows for i = 0 since o £ G ^ <=> the 

valuation of cr (x) — x > 0 for all x £ A L . It follows that the 
valuation of &\m(x) — x > 0 for all x £ Am- The case i = 1 
now follows since the Sylow p-subgroups of the image are the 
image of the Sylow p-subgroups. 

Hence, we obtain, for i = 0 or 1, an inverse system consisting 
of the groups g\ L ^ as L runs through all finite Galois extensions 
of Q p , together with homomorphisms G ^ g\ M ^ whenever 

MCI. Inside UGal(L/Q p ), by definition of inverse limit, we 
have 


G q p = lim Gal(L/Q p ) 

Ul 

G 0 = Imi G;/ 1 

Ul 
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Gi = lim G[ L) 

A second inverse system consists of the groups Gal(L/Q p ) / Gq ^ 
Homomorphisms for M C L are obtained from the restriction 
map and the fact that the image under restriction of G ^ is 
equal to Gq M \ 

Gal(L/Q p ) /G { 0 L) — * Gal(M/ Q„) /G { 0 M) 


Gal{k L /Y p ) ^ Gal(ku/F p ) 

We use this to show that 

G Qp /G 0 * Gal(F p /F p ) 2* Z = Y[Z q . 

q 

We shall see later that 

Go/Gi = n Z 

qGv 

The following will come in useful: 

Theorem 3.10 If H is a closed subgroup of Gq p , then H = 
Gal(Q p /L), where L is the fixed field of H . 

Proof: Let H be a closed subgroup of G q p , and let L be its fixed 
held. Then for any finite Galois extension M/L, we obtain a 
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commutative diagram 



Since Gal(Q p /L ) = lim Gal(M/L), this implies that H is dense 
in Gal(Q p /L). Since H is closed, H = Gal(Q p /L). QED 

In fact infinite Galois theory provides an order-reversing In- 
jection between intermediate fields and the closed subgroups of 
the Galois group. 

3.2 Structure of Gq p /Gq and Gq p /G\ 

We now have normal subgroups G\ C Go C G q . Suppose that 
p : G q — > GL, 2 (R ) is one of the naturally occurring continu- 
ous homomorphisms in which we are interested, e.g. associated 
to an elliptic curve or modular form. We get continuous ho- 
momorphisms p p : Gq p — > GL- 2 {R) , such that p p {G o) = {1} 
for all but finitely many p (in which case we say that p is un- 
ramified at p). For a p at which p is unramified, p induces a 
homomorphism Gq p /Go — > GL/ 2 (R). Often it will be the case 
that p p (G i) = {1} for a ramified p (in which case we say that p 
is tamely ramified atp ). In this case p induces a homomorphism 
Gq p /Gi — > GL, 2 (R). Thus, we are interested in the structure of 
the two groups Gq p /Gq and Gq p /G\. 

Here is a rough outline of how we establish this. In the fi- 
nite extension case considered already, G/Gq is isomorphic to 
Gal(k/F p ) (and so is cyclic), and Gq/G\ embeds in k x (and so is 
cyclic of order prime to p). In the limit, Gq p /Go = lim G / Gq = 
lim Gal(k/F p ) = Gal( F p /F p ) (and so is procyclic), and Gq/G\ 
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embeds in lim k x . A few things need to be checked along the 
way. The first statement amounts to there being one and only 
one homomorphism onto each Gal(k/F p ). For the second state- 
ment, we check that the restriction maps between the Gq/Gi 
translate into the norm map between the k x . This gives an in- 
jective map Gq/Gi — » lim k x , shown surjective by exhibiting 
explicit extensions of Q p . 

As for Gq p /G i, consider the extension of groups at the finite 
level: 

1 Go/Gi G/Gi G/G 0 1. 

Thus, G/G\ is metacyclic , i.e. has a cyclic normal subgroup 
with cyclic quotient. The group GjG\ acts by conjugation on 
the normal subgroup Gq/G\. Since Gq/Gi is abelian, it acts 
trivially on itself, and thus the conjugation action factors through 
G/Gq. So G/Gq acts on Gq/G\, but since it is canonically iso- 
morphic to Gal(k/Fp), it also acts on k x . These actions com- 
mute with the map Gq/G\ —> k x . In the limit, Gq p /G\ is 
prometacyclic, and the extension of groups 

1 - Gq/Gi - G Clp /G l - G Qp /G 0 - 1 

is a semidirect product since Gq p /Gq is free, with the action 
given by that of Gal(F p /F p ) on lim k x . In particular, the Frobe- 
nius map is a (topological) generator of Gal( F p /F p ) and acts 
by mapping elements to their pth power. 

Theorem 3.11 Let Go, G\ denote the Oth and 1st ramification 
subgroups of Gq p . Then Gq p /Gq = lim Gal(k/F p ) = Z and 
Gq/Gi — liin k x = Y\ q ^ p Z q , where the maps in the inverse 
system are norm maps. Moreover, Gq p /G\ is (topologically) 
generated by two elements x,y where y generates Gq/G\, x 
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maps onto the Frobenius element in GqJGq = Gal(F p /F p ) , 
and x~ l yx = y p . 

For any finite extension L/Q p , define G (L) = Gal{Lf Q p ), A L 
is the valuation ring of L , and k,L the residue field of L. 

Let M C L be finite Galois extensions of Q p . We obtain the 
following diagram: 



Note that Go lis in the kernel of each </>£. From this diagram we 
get maps 4>l '■ GqJGq — > Galfik^l F p ). We note the following 
facts. 

1) Given k , there is only one such map, </>&. The reason here 

is that if ki, = = k, then there is a unique map 

Gal{LM/Q p ) Gal(k LM /F p ) Ga^(A:/F p ) 

since Gal{k,LM /F p ) is cyclic. 

Definition 3.12 The valuation ring of the fixed field of fik wdl 
be denoted W{k), the ring of infinite Witt vectors of k. An al- 
ternative explicit description is given in [17]. Note that W( F p ) 
is simply Z p . 

2) Given k , there is some L with kL = k. (We may take 
L = Q P (C), where ( is a primitive (\k\ — l)th root of 1.) 

We consider the inverse system consisting of groups Gal(k/ F p ), 
where k/F p runs through finite Galois extensions, together with 
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the usual restriction maps. Then {Gq p /Gq, {4>k}) is an object of 
the new category associated to this inverse system. We obtain 
a homomorphism 

G Qp /G 0 lim Gal(k/F p ) ^ Z, 

and this is an isomorphism because of (1) and (2) above. 

Next we study the structure of Gq/G\. We have the following 
diagram: 



Note that G\ lies in the kernel of each ipL- It is a simple exercise 
to show that the norm map k f — > kf 4 makes the bottom square 
commutative. 

Theorem 3.13 There is a canonical isomorphism Gq/G\ = 
lim k x . 

Proof: Let L/Q p be a finite Galois extension, and recall that 

g[ l) /g[ l) ^ kf 

naturally. Then 

v WG . i-x 

factors through Gq/G\. In this way we get maps Gq/G\ — > 
kf for each L, and finally a map Gq/G\ — > lim k : x , where the 
inverse limit is taken over all finite Galois extensions kf F p , and 
for k\ C A,“ 2 , the map kf —> kf is the norm map. The fact that 
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Gq/Gi — > liin k x is an isomorphism, follows by exhibiting fields 

L/Q p with Gq^/Gi^ = k x for any finite field kj F p , namely 
L = QpiCiP 1 ^ 1 ) where d = \k\ — 1 and ( is a primitive dth root 
of 1 . QED 

Note: the helds exhibited above yield (surjective) fundamental 
characters Gq/G\ —> k x . Let p n be the group of nth roots of 1 in 
F p , where (p, n) = 1. For m|n, we have a group homomorphism 
Hn Pm given by x i— > x n / m , which forms an inverse system. 
Let fi = lim fi n . 

Lemma 3.14 We have a canonical isomorphism lim/c x = /i . 


Proof: Since the groups k x form a subset of the groups fi n , we 
obtain a surjection from lim p n — > lim k x . To obtain a map in 
the other direction, note that the numbers pf — 1 are cofinal 
in the set of integers prime to p. In fact, if d is such an integer 
integer, there is some f > 1 with pf = 1 (mod cl), e.g. / = 
c p{d). QED 

/i is noncanonically isomorphic to U q ^ p Z g , since p n is non- 
canonically isomorphic to Z/nZ, and 


limZ/nZ = Z q , 

<rhv 


since Z/nZ = YlZ/pfZ for n = n p r f ■ 
We now examine the map 

G[ L) /G[ L) ^ k x L . 


One can check that it is O^^-equi variant (with the conjugation 
action on the left, and the natural action on the right). Now 
Gq L ^ acts trivially on the left (G^ / g[ L ^ is abelian ), and also 
trivially on the right by definition. Hence we end up with a 
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G^/G { 0 L \= Galfk^j F p ))-equivariant map. The canonical iso- 
morphism Go/Gi — > lim k x is Gq p / Go-cquivariant. 

The upshot is that Gq p /G\ is topologically generated by 2 
elements x, and y, where x generates a copy of Z, y generates 
a copy of Uq^p Z q , with one relation x~ l yx = y p . This is seen 
from the short exact sequence of groups: 

1 - G 0 /G l - G Ct JG l - G Qp /G 0 - 1, 

in which Gq/G\ = Yl q ^ p Z g , and Gq p /Gq = Z are both procyclic 
(i.e topologically generated by one element). Since Z is free, 
the sequence is split, i.e. defines a semidirect product with the 
action given as above. 

Next, let p : G q — > GL n (k ) be given, where k is a finite held of 
characteristic £. The image of p is finite, say Gal{Kj Q), where 
K is a number held. Letting the hnite set of rational primes 
ramified in K/Q be S, we see that p is unramihed at every 
p 0 5, i.e. Pp(Go ) = {1} and so p p factors through Gq p /Gq for 
all p £ S. Consider next p £ : Gq t — > GL n {k). 

Call p(, semisimple if V := k r \ viewed as a ^[^Qj-module, is 
semisimple, i.e. a direct sum of irreducible modules. 

Theorem 3.15 If pt is semisimple, then pi(G i) = {1}, and so 
Pi factors through GqJGi. 

Proof: Assume V is irreducible, i.e. has no proper ^GciJ-submodules. 
Let V' = {v G V\g(v) = ufor dllg e pi{G\)}. Since G\ is a pro- 
£ group, pi(G i) is a hnite A group and so its orbits on V are 
of length 1 or a power of £. The orbits of length 1 comprise 
V' , so that \V'\ = \V\ (mod £) = 0 (mod £). In particular, 

V 7^ {0}. Since G\ is normal in G^ n V' is stable under G^ n 
implying by irreducibility of V that V' = V. Thus Pi(G i) acts 
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trivially on the whole of V. For a semisimple V, we apply the 
above to each summand of V. QED 

The Big Picture. We have defined subgroups G q p (one for 
each prime p) of G q with much simpler structure than G q itself. 
This will allow us to describe representations p : G q — > GL n (R ) 
in terms of their restrictions p p to these subgroups. Each p p can 
be described in turn by its effect on ramification subgroups of 
G q , ultimately enabling us to define useful numerical invari- 
ants associated to p (see chapter 5). First, however, we need 
some natural sources of Galois representations p and these will 
be provided by elliptic curves, modular forms, and more gen- 
erally group schemes. <f> 
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Galois Representations from Elliptic Curves, 
Modular Forms, and Group Schemes 


Having introduced Galois representations, we next describe nat- 
ural sources for them, that will lead to the link with Fermat’s 
Last Theorem. 


4.1 Elliptic curves 


An elliptic curve over Q is given by equation y 2 = f(x), where 
/ G Z [x\ is a cubic polynomial with no repeated roots in Q. A 
good reference for this theory is [32] . 


Example: y 2 = x(x — l)(x + 1) 

If K is a field, set E(K) := {(x,y) £ KxK\y 2 = /(x)}U{oo}. 
We begin by studying E( C). 

Let A be a lattice inside C. Define the Weierstrass p- function 

by 


p(z;A) 


3+ E 


z t 


o^weA 



1 




) 
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and the 2kth Eisenstein series by 

G 2k ( A) = £ 

OpueA 

These arise as the coefficients in the Laurent series of p about 
z = 0, namely: 

p(z) = —z + ?>G^z^ + 5(j6^ 4 T 7G 8 z 6 + . . . , 
z A 

which is established by rearranging the infinite sum, allowed by 
the following. 

Proposition 4.1 The function p is absolutely and locally uni- 
formly convergent on C — A. It has poles exactly at the points 
of A and all the residues are 0. G^A) is absolutely convergent 
if k > 1 . 

Definition 4.2 A function f is called elliptic with respect to A 
(or doubly periodic) if f(z) = f(z-\-cu) for all wgA. Note that 
to check ellipticity it suffices to show this for cu = U\,U 2 , the 
fundamental periods of A. An elliptic function can be regarded 
as a function on C/A. 

Proposition 4.3 p is an even function and elliptic with re- 
spect to A. 

Proof: Clearly, p is even, i.e. p(z) = p(—z). By local uniform 
convergence, we can differentiate term-by-term to get p'(z) = 
— 2EcjgA us ? an d so p' is elliptic with respect to A. Integrat- 
ing, for each weA, p(z + c o) = p(z) + C( u), where C{uf) does 
not depend on z. Setting z = — oy/2 and u = u>i (i = 1 or 2) 
gives 

p(ui/2) = p(—Ui/2) + C{ui) = pipJi/2 ) + C{ui) y 
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using the evenness of p. Thus, C(cUi) = 0 (i = 1,2), so p is 

elliptic. QED 

Let g 2 = 60 G 4 (A) and g% = 140^6 (A). 

Proposition 4.4 ( p'(z )) 2 = 4 p{zf - g 2 p{z ) - g^z A).(*) 

Proof: Let f(z ) = (p'(z)) 2 — (ap(z) 3 — bp{z) 2 — cp(z) — d). Con- 
sidering its explicit Laurent expansion around 2 = 0, since / is 
even, there are terms in z~ 6 , z~ 4 , z~ 2 , z° whose coefficients are 
linear in a, 6, c, d. Choosing a = 4, b = 0, c = g 2l d = gs makes 
these coefficients zero, and so / is holomorphic at 2 = 0 and 
even vanishes there. We already knew that / is holomorphic 
at points not in A. Since / is elliptic, it is bounded. By Liou- 
ville’s theorem, bounded holomorphic functions are constant. 
/( 0) = 0 implies this constant is 0. QED 

Theorem 4.5 The discriminant A (A) := g\ — 27 g\ 0, and 

so 4ic 3 — g 2 x — c /3 has distinct roots, whence E\ defined by y 2 = 
4ic 3 — g 2 x — c /3 is an elliptic curve (over Q if c/ 2 , #3 C q; 

Proof: Setting o ; 3 = uji+uj 2 , p'{ui/ 2) = -p'(-c^/2) = -p / (cj ?: /2) 
since p' is odd and elliptic. Hence p'iui/ 2) = 0. Thus 4p(cj ? ;/2) 3 — 
g 2 p(ui/2) — g 3 = 0. We just need to show that these three roots 
of 4ic 3 — g 2 x — ^3 = 0 are distinct. 

Consider p(z) — p(ui/2). This has exactly one pole, of order 2, 
and so by the next lemma has either two zeros of order 1 (which 
cannot be the case since this function is even and elliptic) or 1 
zero of order 2, namely Ui/2. Hence p(uJi/ 2) — p(u>j/ 2) 0 if 

QED 

Lemma 4.6 If f is elliptic and v w (f) is the order of vanishing 
of f at w, then E we c/A v w(f) = 0. Moreover, the sum of all the 
zeros minus the poles is 0 (mod A) . 
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Proof: By Cauchy’s residue theorem, E w€ c/a v w (f) = ^ / ^ dz , 
where the integral is over the boundary of the fundamental par- 
allelogram with vertices 0, coi, 0^2 , CJ3 (if a pole or zero happens 
to land on the boundary, then translate the whole parallelo- 
gram to avoid it). By ellipticity, the contributions from parallel 
sides cancel, so the integral is 0. The last statement is proved 
similarly, using ^ / z^jdz. QED 

Theorem 4.7 There is a bijection (in fact a homeomorphism 
of Riemann surfaces) <f : C/A — > E\(C) given by 

z 1— » (p(z), p'(z))(z A), z 1— > oc(z e A). 

Proof: Ellipticity of p and p> implies that <f is well-defined and 
(*) shows that the image is in C). To show surjectivity, 
given (x,y) G E\(C) — {00}, we consider p(z) — x, a noncon- 
stant elliptic function with a pole (at 0) and so a zero, say at 
z = a. By (*), p'( a ) 2 = y 2 . By oddness of p' and evenness of p, 
we see that <f>{a) or <f(—a) is (x,y). 

To show injectivity, if 4 >(z\) = ^(^2) with 2 zi ^ A, then con- 
sider p(z) — p(z\), which has a pole of order 2 and zeros at 
Zi, —Zi, Z2, so Z2 = Ez 1 (mod A). If also (f>(z\) = <j>'(z2), then 
this fixes the sign. QED 

Note that this bijection from C/A, which is a group (in fact a 
torus), puts a group structure on E\( C), so that it is isomorphic 
to R/Z x R/Z. Our desired Galois representations will come 
from Galois actions on certain finite subgroups of E\(C). Later, 
we shall see that every elliptic curve over C is of the form E\. 

Theorem 4.8 The group law on E\(C) is given by saying that 
three points P\, P2, P3 add up to the identity, 00, if and only if 
Pi, P2, P3 are collinear. If two of the points coincide, this means 
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the tangent at that point. 

Proof: Fixing Zi, z 2 , let y = mx + b be the line through P\ = 
and p 2 = {p(z 2 ),p'(z 2 )). 

Consider f(z ) = p'(z) — mp(z) — b, which has one pole of 
order 3 at 0, whence three zeros. Two of these are z\,z 2 - let 
the third be z 3 . Since the zeros minus poles equals 0, we get 
z 1 + z 2 + z 3 = 0. Thus, if P 3 = (p(z 3 ), p'(z 3 ), then P 1 + P 2 + P 3 = 
00 . QED 

The importance of this is that it means that the coordinates 
of (xi, y\) + (x 2 , y 2 ) are rational functions in x\, x 2 , yi, y 2 , g 2 , g 3 . 
This yields a group structure on E(K ) whenever g 2 , g 3 G K, 
since e.g. the associative law is a formal identity in these ratio- 
nal functions. 

Definition 4.9 Given an elliptic curve E over Q and positive 
integer n, the n-division points of E are given by E[n\ := {P G 
E(C)\nP = 00}. 

Example: If E is y 2 = fix) G Z[x], let cq (i = 1,2,3) be 
the roots of fix) = 0 and Pi = (cq,0). The tangent at Pi is 
vertical and so Pi, Pi, 00 are collinear, whence 2 Pi = 00. Thus 
E[ 2] = { 00 , Pi, P 2 , P 3 }. (No other point has order 2 by the 
following.) 

Lemma 4.10 Let E be an elliptic curve over Q. Since C/A = 
R/Z x R/Z, E [n\ = Z jn x Z/n. Moreover, there are polyno- 
mials f n G Q[x] such that E[n\ = {(x,y)\f n {x) = 0 } U {00}. 

Proof: The elements of R/Z of order dividing n form a cyclic 
group of order n. The f n come from iterating the rational func- 
tion that describes addition of two points on the curve. QED 
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Example: Continuing the case n = 2, we see that /2 = /. 

By the last lemma, E[n\ C E(Q) and if P = (x, y ) G E'fn] and 
a G Cq, then <j(P) = (a(x),a(y) G C[n]. This action of Gq 
on E[n\ = Z/n x Z/n produces a homomorphism pE, n '■ G q — > 
GL^iZfn). These are the Galois representations associated to 
E. 

Let £ be a prime. Consider the inverse system consisting of 
groups GL2(Z/£ n ) together with the natural maps between 
them. An object of the corresponding new category is given 
by (Gq,{pej"}), yielding a homomorphism p E jn*> : Gq — > 
CL 2 (Zf), called the t-adic representation associated to E. An- 
other way of viewing this is as the Galois action on the i-adic 
Tate module T^{E) = lirri E [£ n ] , where the maps in the inverse 
system are E[£ n ] — > E[£ m } for n > m dehned by P i— > £ n ~ m P. 

Example: Continuing the case n = 2, note that GL^iZfT) = A3, 
the symmetric group on 3 letters. The action of G q on E[ 2] = 
{00, (ai, 0 ), (q!2, 0 ), (q!3, 0 )} amounts to permuting the roots oi{ 
of /, and so the image of pE.2 is Gal{Kf Q) < A3 where K is 
the splitting held of /. 

Given a typical E (meaning one with no complex multiplica- 
tions), ps,e 00 is surjective for all but finitely many primes £. In 
fact it is surjective for all £ for a set of elliptic curves of density 
1, for example if E is the curve y 2 + y = x 3 — x (elliptic - you 
can complete the square). 

These Aadic Galois representations encode much information 
about elliptic curves. For example, for a fixed £, pE/°° and pE',e°° 
are equivalent (conjugate) if and only if E and E' are isogenous. 
Later on, we shall study in great detail the Aadic representa- 
tions associated to semistable elliptic curves. 
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4.2 Group schemes 

An elliptic curve E defined over Q yields groups E(A ) for any 
Q-algebra A. This can be usefully generalized as follows. Ulti- 
mately we shall define finite, flat group schemes and see that 
they provide a quite general source of Galois representations. 
A good resource for this section is [37]. 

Definition 4.11 Let R be a commutative ring with 1. An affine 
group scheme over R is a representable functor from the cate- 
gory of R- algebras (i.e. rings A together with a homomorphism 
R . — > A with morphisms ring homomorphisms that make a com- 
mutative triangle over R) to the category of groups. 

Recall that a functor F is representable by the A- algebra 3ft if 
and only if F(A) = hom#_ a / g (3ft, A). In general, a representable 
functor from the category of A- algebras to the category of sets 
actually defines an affine scheme over A - the added group 
structure gives the representing ring 3ft the structure of a Hopf 
algebra. 

Example: (i) The functor G a given by G a (A) = A + is repre- 
sentable since A + = hom n- a i g (R[T], A) (an A-algebra homo- 
morphism from R[T] to A is determined by whatever T maps 
to, and this can be any element of A). 

(ii) The functor G m given by G m (A) = A x (the units of A) is 
representable since A x = hom n_ a i g (R[X, Y]/(XY — 1), A) (un- 
der an A-algebra homomorphism, X has to map to something 
invertible in A and then Y maps to its inverse). 

(iii) More generally, the functor GL n is representable since 
GL n (A) = hom R - a i g (R[T lh T u , ..., T nn , Y]/{det{{fT ij ))Y-l),A). 
Note Cj L\ = G m . 
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(iv) The functor fi n defined by /d n (A) := {x G A\x n = 1} is 
representable since fr n (A) = hom#_ a ; ff (i?.[T]/(T n — 1), A). fi n is 
a subgroup scheme of G m . 

The following example shows how certain group schemes can 
give rise to Galois representations. It will be generalized below. 

Definition 4.12 Let K be a field, £ char K. Then Gk acts 

on mn(K) = Z/£ n . This yields a representation Xt n '■ Gk 
GL\(Z/ £ n ) = (fZ/£ n ) x . Putting these together yields xt°° '■ Gk — > 
GL i(Zf) = Zf , called the i-adic cyclotomic character. As with 
elliptic curves, we can provide an alternative deflation by do- 
ing the inverse limit before the Galois action; namely letting 
Tflfi) := lim /j,g n , then xe°° gives the action of Gk on the Tate 
module Tflji) . 

Exercise: Let K = Q. Show that the Gadic cyclotomic char- 
acter y is unramified at all primes p i, i.e. Xp '■ G q p — ■» Zf 
factors through the inertia subgroup Gq. This then induces a 
map Gal(F p /Fp) = GqJGq — > Z| . Show that the image of the 
pth Frobenius element is p. (Hint: note that a(() = for 
any t - power root of 1 in Q.) 

Exercise: Some universal ring constructions. 

(i) Given ring homomorphism R — > S and iGalgebra A, con- 
sider the collection of rings B and homomorphisms that make 
the following diagram commute: 

,1 - -A.P 


R 


S 
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Show that these form the objects in a category of S-algebras, 
with an initial object. This initial object is the tensor product 
A ®r S. In particular, i?[Ti, . . . , T n ] ®r S = S[Ti, . . . , T n \. 

(ii) Given x G R , consider the collection of i?-algebras A such 
that under R — > A, x maps to a unit in A. Show that these 
form the objects in a category, with an initial object. This initial 
object is the localization of R at x. 

The case of (i) we shall be most interested in is where R = Z^. 
If A = Z™ x T ( T torsion) as a Z^- module, then A<S>rQ( = Q” 
as a Q<?-module and A <S>r F<? = F ™ x T/£T as an F^- module. 

Let <fi : R — > S be a ring homomorphism. Given a functor 
F on A-algebras, we get a functor F' on S'-algebras since via 
composition with f every A-algebra S — > A is an A-algebra. If 
3ft is an A-algebra, then hom ( s_ a ; 3 (3ftG j R<S', A) = honiR_ a / 3 (3ft, A) 
(adjoint associativity - follows quickly from universal descrip- 
tion of tensor above), so if F is representable (by A-algebra 3ft), 
then F' is representable (by S'-algebra 3 ft 0^5). This process is 
called base change. 

A homomorphism of affine group schemes over R is a natu- 
ral map G — > H. This gives us for each R- algebra A a group 
homomorphism G(A) — >• H(A) such that whenever A — > B 
is a homomorphism of i?-algebras, the following diagram com- 
mutes: 


G(A) — *H(A) 
G(B) — *H(B) 


Here the vertical maps come from G and H being functors. 
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Exercise: For each /?- algebra A, define E(A) = ker(G(A) — > 
H(A)). Show that F is a group scheme over R. 

For example, the determinant map gives a homomorphism 
from GL n to G m with kernel SL n , and the map x i— > x n a 
homomorphism from G rn to G m with kernel fi n . It requires a 
lot more work to give the cokernel a functorial description. 

We now have the objects and morphisms of the category of 
affine group schemes over a fixed ring R. An elliptic curve 
E : y 2 = f{x) over Q gives for each Q-algebra A a group 
E(A) and for each Q-algebra map A —> B a group homomor- 
phism E(A) — > E{B). Is it an affine group scheme? Actually 
not - the obvious try is 3ft = Q [x,y\/(y 2 — fix)), in which case 
homQ_ a ^(3ft, A) yields the points (x,y) with coordinates in A 
satisfying y 2 = fix). This, however, misses the point at oo. In 
other words, it shows that E{A) — {oo} defines an affine scheme. 
The answer is to define group schemes in general, obtained by 
patching together affine schemes called charts. For instance, 
E(A) — {oo} provides such a chart. We say more about non- 
affine group schemes later. 

It turns out that the n-division points are nicer, since they 
define affine group schemes. 

Definition 4.13 An R-algebra A is called finite if it is finitely 
generated as an R-module. (Note that this is stronger than being 
finitely generated as an R-algebra - consider e.g. R[T].) Let G 
be an affine group scheme over R. We call G finite over R if 
its representing ring 3ft is a finite R-algebra. 

In fact, any group scheme finite over R is affine. In particular, 
if E : y 2 = f(x) is an elliptic curve over Q and £ a prime such 
that f(x) (mod £) has distinct roots, then E[n\ is a finite (of 
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rank n 2 ) affine group scheme over Q p . (It can be shown directly 
to be affine by finding a polynomial / whose zero set V((f)) - 
see below - does not meet E [n] , whence E[n\ lies in an affine 
chart Uf. See Conrad’s article in [5] for details. A nice explicit 
description of E[n\ can be found in [9].) This will be discussed 
in the section on reduction of elliptic curves in the next chapter. 

Next, we make explicit the kind of group schemes that yield 
Galois representations useful to us. 

Definition 4.14 Let G be a finite group scheme over a field 
K, e.g. represented by K -algebra 3ft. We call G etale «/3ft is an 
etale (or separable ) K -algebra, i.e. 3? K = K x ... x K. 

Example: Let G = /^, £ prime. We saw that 3ft = K[T ]/ (T f: — 1). 
Then 3ft ® K K = K[T]/(T e - 1). If char K = £, then this is 
K [T]/(T— l)f which contains nilpotents and so G is not etale. If 
char K 1, then by Chinese remainder this is Ip=o AT[T] /{T — 
( l ) = K x ... x K, and so G is etale. 

Note that the etale situation here is that in which we ob- 
tained useful Galois representations earlier, namely the cyclo- 
tornic characters. This is clarified as follows (more details can 
be found in [37]). 

Theorem 4.15 The category of etale group schemes over K 
is equivalent to the category of finite groups with Gk acting 
continuously. 

Proof: Given an etale group scheme G, represented say by 3ft, an 
etale Id- algebra, consider the group G(K) = hom^_ a / 5 (3ft, K). 
Since 3ft is etale, | hom^_ a / 5 (3ft, K)\ = dim^3ft, so G{K) is a 
finite group. Given a Id- algebra 3ft — > K and o G Gk , the 
action is given by composing to get 3ft — > K K. The images 
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of 3? all lie in some finite Galois extension of K and so the 
action is continuous. 

Conversely, given a finite group H with continuous G#-action, 
consider first the case of transitive action, say H = Gxh. By 
continuity, choose finite extension L of K such that the action 
of Gk factors through Gal(L/K). Let S be the subgroup fixing 
h and 3ft C L its fixed held. By Galois theory, all maps 3ft — > K 
map to L and are conjugate, yielding a G^-isomorphism H — > 
hom^_ a ^(3ft, K) by sending h to one of them. If the action is 
intransitive, we obtain for each orbit a ring 3ft; and then n 3ft; 
works. 

QED 

Definition 4.16 An R- algebra A is called flat if tensoring with 
A is an exact functor, i.e. if for any short exact sequence 0 — » 
M — > N — > L — > 0 of R-modules, 0 — > M <S>r A — > N <S>r A — > 

L <S>r A — > 0 is also exact. A finite group scheme over Spec R 
is called flat if its representing ring A is a flat R-algebra. 

If R = Z^, then finite hat is equivalent to A being free of hnite 
rank over R. 

Let p : Gq ( — > GL n ( Z/t m ). The above theorem associates to 
p an etale group scheme G over Qf. Call p good if there is a 
hnite hat group scheme F over Z<? such that the base change of 
F under Z^ — » is G. This will be needed in defining when a 

Galois representation is semistable at t in the next chapter. 

4.3 General schemes 

We next reconcile the above with the usual approach to schemes. 
We begin by defining affine schemes. 
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Let R be a commutative ring with 1 and Speci? denote the set 
of prime ideals of R. For example, SpccZ p = |(0),pZ p }. Then 
Speci? comes with a topology, the Zariski topology , defined by 
having the closed sets be the sets V{I ) as / runs through all 
ideals of R , where 

V(I) := {p £ Speci?|/ C p}. 

Exercise: Show that this does indeed define a topology on Speci?, 
i.e. that 0, R are closed sets and that arbitrary intersections and 
finite unions of closed sets are closed. Show that SpecZ^ is not 
Hausdorff. Show that Speci? is always compact. 

If / : R — > S is a ring homomorphism and p is a prime ideal 
of S, then i?// _1 (p) — > S/p is an injective homomorphism into 
an integral domain, and so / _1 (p) is a prime ideal of R. Thus 
/ induces a map Specs' — > Speci?, which can be checked to be 
continuous with respect to the Zariski topologies. For example, 
if / is an ideal, then since the prime ideals of R containing / are 
in bijection with those of R/I, Spec (R/I) — > Speci? is an injec- 
tion with image V (/). If x (E R, then likewise Specif — > Speci? 
(from localization) is injective with image Speci? . — V((x)). We 
thus think of Speci? as a ringed space by having R x be the ring 
of functions on this basic open set. 

We shall identify Spec3£ with the affine scheme represented 
by 3L This is fair since to each /Falgcbra A, Spec3^ associates 
the set Spec(y4 <S>r 3ft), closely related to (but not exactly the 
same as) hom^_ a ^(3ft, A). 

Note that if A is an ift-algebra, then the map R — » A induces 
a map Spec^l — > Speci?, and SpecA will be called an affine 
scheme over Speci? via this map. Base change to Spec3ft replaces 
Spec^l by Spec(v4<g)i?3ft). An A- algebra homomorphism A — > B 
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Spec B 


SpecA 



Spec A 


i.e. a morphism of affine schemes over SpecA. The category 
of affine schemes over SpecA is hereby anti- equivalent to the 
category of A- algebras. 

Exercise: Let p G SpecA and let K,(p) = Frac (R/p). Show that 
k(p) is an A- algebra, so that Spec n(p) embeds in SpecA. Let 
A be an A-algebra, so that SpecA is a cover of SpecA. Show 
that its fibre over p can be identified with Spec(A <S>r k(p)- 

A scheme then is a ringed space admitting a covering by open 
sets that are affine schemes. Morphisms of schemes are defined 
locally, i.e. / : S' — >• S is a morphism if there is a covering 
of S by open, affine subsets Specif such that / _1 (SpecA;) is 
an affine scheme Spec A' and the restriction map SpecA' — > 
SpecA; is a morphism of affine schemes. We say that S' is a 
scheme over S. In Grothendieck’s approach, this relative notion 
is important rather than absolute questions about a scheme. 
Questions about / turn into questions about the ring maps 
fi : Rj — > A'. In particular, we say that / has property (*) (for 
example, is finite or flat), if there is a covering of S such that 
each of the ring maps fi has this property. 

If A is a scheme, then a group scheme over S is a representable 
functor F from the category of schemes over S to the category 
of groups, i.e. there exists some scheme S over S such that 
F(X) = hom s - schemes (X, S). For example, an elliptic curve 
over Q is a (non- affine) group scheme over SpecQ. 
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4.4 Modular forms 


Another source of Galois representations (in fact, which turns 
out to produce all that we are interested in) is modular forms. 
For this section, [28], [19], and [11] are recommended. Note that 
elliptic curves will ultimately correspond to modular forms of 
weight 2 and trivial Nebentypus, and so those forms will be 
highlighted. 

Fix positive integers k, N and homomorphism e 
C x . Let H = {ze C|Im(z) > 0}. For a = ' a 6 


c d 


Z/N) x -► 

e sl 2 ( z), 


set az = 


Set 


and 


az+b 

cz+d 


and f\[a] k (z) = (cz + d) k f(oz). 


r o(JV) := { 


Vi(JV) = { 


a b 
c d 

a b 
c d 


<E SL- 2 (Z)\c = 0 (mod IV)} 


G r 0 {N)\d= 1 (mod N)}. 


Definition 4.17 A modular function of weight k, level N, and 
Nebentypus e, is 

(i) a meromorphic function on Ti, 

(ii) satisfies /|[<r]fc = e(d)f for all a G Fq(A^) ; 

(in) is meromorphic at all cusps. 

If the form is holomorphic on TL and at the cusps, then it is 
called a modular form. If it vanishes at the cusps, then it is 
called a cusp form. We now explain (Hi). 

Note that by (ii) f(z + 1) = f(z). Thus, f has a Fourier 
expansion in terms of q = e 2iriz , say f = Ta n q n . 

We say that f is meromorphic (respectively, holomorphic, 
vanishes) at oc if a n = 0 for all n < some no (respectively 
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n < 0, n < 0). We say that f is meromorphic (respectively, 
holomorphic, vanishes) at the cusps if f\[cr]k is meromorphic 
(respectively, holomorphic, vanishes) at oc for all a G SL 2 (Z). 

For example, if N = 1, then T 0 {N) = T^N) = SL 2 ( Z). If 
/ is a modular form of this level, then its Nebentypus must 
be trivial. Moreover, f\[cr\k = / for all cr G SL 2 (Z) and so the 
cusp condition only need be checked at oo. In general, one has 
finitely many conditions to check, taking a running through the 
finitely many cosets of To (IV) in SL 2 (Z). 

The set of modular forms (respectively cusp forms) of weight 
k, level N, and Nebentypus e will be denoted M k (N,e) (re- 
spectively S k (N,e)). As will be shown later, these are finite- 
dimensional C-vector spaces. 

Exercise: Show that if /, f are modular functions of weights 
k,k' respectively and level 1, then are modular func- 

tions of weights k + k', k — k! respectively and level 1. Show that 
for A G C, A / and if k = k', then / + f are modular functions 
of weight k, level 1. 

Example: Let A be the lattice in C generated by fundamental 
periods 1, r, where r G H. G 2k {r ) = G 2k ( A) = E( m , n )^(o,o) (mT + n)2fc 
(the Eisenstein series) is a modular form of weight 2k and 
level 1. (ii) follows since G 2k (XA) = \~ 2k G 2 k(A) and (iii) fol- 
lows since uniform convergence allows passage to limit term 
by term, the m 0 terms giving 0, the m = 0 terms giving 
£n#o n- 2k = 2C(2 k). 

For this same A, A = — 27 is therefore a modular form 

of weight 12 and level 1. Using the known values of £(4), £(6), 
we get its constant coefficient 0, and so it is a cusp form. In 
fact, looking deeper shows that A = (27r) 12 q II^Li(l — q n ) 24 [28]. 
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Henceforth, we shall normalize A so that its first coefficient is 

1 . 

Another important example is j(r) := 17 ^ g2 , which is a mod- 
ular function of weight 0 and level 1, and thus defines a map 
j : SL 2 (Z)\H — > C. Its Fourier expansion ^ + 744+ 196884^ + 
. . . has fascinating connections with the Monster finite simple 
group. 

The best way to think of modular forms is in terms of asso- 
ciated Riemann surfaces, called modular curves. 

4-4-1 Riemann surfaces 

A surface is a topological space S which is Hausdorff and con- 
nected such that there is an open cover {U a \a G A} and home- 
omorphisms <f> a of U a to open sets V a C C. Then (U a , </> a ) is 
called a chart and the set of charts an atlas. If U a fl Up ^ 0, then 
transition function t a p = fpfh 1 ■ <t>a(U a fl Up) — >■ (f)p(U a fl Up). 
Call S a Riemann surface if all the t a p, where dehned, are an- 
alytic. 

Example: (i) Let Cqo = C U {oo}. Take the topology with open 
sets those in C together with {oo} U (C — K) (K compact 
in C). Let U a = C,<f> a (z) = 2 ; and Up = - {0 },<f)p(z) = 

1 j z[z G C), fp ( 00 ) = 0. These two charts make Cqo a compact 
Riemann surface, identifiable with the sphere. 

(ii) If A is a lattice in C, then C/A is a compact Riemann 
surface. 

We now introduce another useful class of compact Riemann 
surfaces: 


Definition 4.18 Let hi* = hi U Q U {oo} ; and put a topology 
on hi* by taking the following as basic open sets: 
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(i) about a point in Tt any open disk entirely inside hi; 

(ii) about oo, {Imr > r} for any r > 0/ 

(Hi) about x G Q, D U {ic} ; where D is any open disk in hi 
of radius y > 0 and center x + iy. 

Extend the action of a = ^ ° ^ j on hi to act on x = \x, 1] G 

Q and oo = [1,0] by a[x,y\ = [ax + by,cx + dy\ G P 1 (Q) 
(i.e. homogenized). Let YfN) = F C F = 
Xi(N) . These are compact Riemann surfaces and called modu- 
lar curves. See [19] p.311 or [11] p.76 for more details. 

Riemann surfaces form a category where the morphisms are 
analytic maps defined thus. 

Definition 4.19 Call a continuous map f : R — > S of Rie- 
mann surfaces analytic if the maps f’gfff 1 are analytic maps 
on domains in C ; wherever defined. An analytic map f : R — > 
Cqo is a meromorphic function on R (matches the usual defi- 
nition, setting f(p) = oc ; if f has a pole at p). The collection 
of all such functions is a field, the function field K(R). 

Example: (i) The meromorphic functions on torus C/A are just 
the elliptic functions with respect to A (in fact = C(p, p')). 

(ii) The meromorphic functions on modular curve Xq(N) are 
just the modular functions of weight 0, level N, and trivial 
Nebentypus. 

The most important result here is the Open Mapping The- 
orem, stating that any analytic nonconstant map of Riemann 
surfaces maps open sets to open sets. 

Exercise: Using this, show that if / : R — >• S is one such and R 
compact, then f(R) = S, and so S is compact. Deduce Liou- 
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ville’s theorem. 

Moreover, show that / is a A;-to-l map for some k (hint: let 
S m C S be those points with precisely m preimages, counting 
multiplicity. Show that S m is open and then use compactness 
and connectedness of S). 

Lemma 4.20 The function j is surjective. 

Proof: Consider j as a map Xq( 1) — > Coo- Since A has a simple 
zero at oc and no others (as we showed in establishing E\ is an 
elliptic curve), j has a simple pole at oc and no others. Thus, 
invoking the exercise above, j is 1-to-l. QED 

Corollary 4.21 Every elliptic curve over C is of the form E\, 
for some lattice A. 

Proof: Given y 2 = /( x) with / £ C[x] a cubic with distinct 
roots, we can, by change of variables, get y 2 = 4ic 3 — Ax — B 
for some A, B £ C. The claim is that there exists A such that 

c/2 (A) = A, /73(A) = B. From the definition of j above, we 

2 

can find r such that ^ takes any value other than yj. Pick 

r such that this equals (7^ 2 7 s i nce / has distinct roots). 
Choose A such that <72 CO = A 4 A. Then g 3 (r) 2 = A 12 B' 2 , so 
gz{r) = ±A 6 B. If we have the negative sign, then replace A by 
iX. Noting that Eisenstein series satisfy by definition ^(AT) = 
A _4 ^2(-h), gs(XL) = A' 6 £ 3 (£), we are done if we take A = XL 
where L has basis 1 ,r. QED 

Note that by the last theorem, j identifies A 0 (l) with C U 
{oc}. In fact: 

Theorem 4.22 The modular functions of weight 0 and level 1 
are precisely the rational functions of j , i.e. the function field 
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K(X o(l)) = C(j). 

Proof: An earlier exercise showed that rational functions in j 
are modular functions of that weight and level. Conversely, if / 
is such a function, say with poles ry counted with multiplicity, 
then g = — j{rj)) is a modular function of weight 0 

and level 1 with no poles in Tt. If g has a pole of order n at oo, 
then there exists c such that g — cj n has a pole of order n — 1 
at oo (and no others). By induction, g minus some polynomial 
in j has no pole in 7 L C, and so is constant. Thus, g, and so 
/ e C (j). QED 

Letting cq run over all integer matrices 

with ad = N, d > 0,0 < b < d, gcd(a, b, d) = 1} 

(of which there are n{N) := N U p \n0- + \) suc h matrices), the 

modular polynomial of order N is <&n(x) = Ui=i\x—joai). One 

root is j N := j o a , where a = ^ J j (i.e. j N (z ) = j(Nz)). 

In fact, n(N) = [SL- 2 (Z) : To (AT)] and so is the degree of the 
cover Xq(N) — > Aq( 1). The corresponding extension of func- 
tion fields I\(X o(l)) C I\(Xq(N)) is then also of degree g(N). 
K(Xq(N )) turns out to be A(Ao(l))(.bv): 

Theorem 4.23 See [19], p. 336 on. 4>at(x) has coefficients in 
Z \j\ and is irreducible over C (j) (and so is the minimal polyno- 
mial of jn over C(j)). The function field K(Xq(N)) = C(j, Jn) ■ 

This enables us to define Xq(N), a priori a curve over C, over 
Q. This means that it can be given by equations over Q. If 
N > 3, then one can further define a scheme over Z[1/AT], so 
that base change via Z [1/iV] — > Q yields this curve (in other 
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words, we have a model for Xq(N) over Q with good reduction 
at primes not dividing N ). 

Proof: First, we note that j, jw are indeed in K(Xq(N)), i.e. 
satisfy f(z ) = f{pz ) for all a £ To (AT). This clearly holds for 
j since j is a modular function of weight 0 on all of SZ^Z). 
Since jN{az) = j o a o a(z) = jo (aaa~ 1 )a(z) and olooT 1 = 

( A A c ^ e ^( Z ), i ° a ( z ) = 3 n { z ). The 

condition at the cusps is easily checked. 

Next, we note that joafl < i < g(N)) are distinct functions 

on H. If a i = ^ o h ( l )’ then J ° ai ^ = * + ■■■, 

where = e 2mz ! d , Crf = e 2m ^ d . If jocq = jocq/, take the quotient 
and let Imz — > 0 to get = Qd'Cd'- So a/d = a' /d! , but since 
ad = N = a!d! and all are positive, a = a' ,d = <f, whence b = b' 
too. 

Next, we show the properties of Ttv If 7 £ ST^Z), then 
we check that cqy = (doth for some /c and (3 £ ST^Z). Thus 
j o cq o 7 = j o Q!fc, whence 7 permutes the roots of Tyv, and 
so its coefficients are invariant under 6X2 (Z), hence £ C(j) 
(meromorphic since polynomials in the j oaf). In fact, one easily 
computes that 6X2 (Z ) acts transitively on the roots of <f>yv, 
whence <Fyv is irreducible over C (j). To show its coefficients he 
in Z[j], we see that they lie in Z [Qv] since d \ N. Automorphism 
Cn >— > Cn ( an y r coprime to N) permutes the j o cq, and so the 
coefficients he in Qnz[c«] = z. 

QED 

These polynomials <f>yv tend to have huge coefficients, but at 
least they define Xq (N) as a curve over Q. 

If A is a compact Riemann surface of genus g and IT = 
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n hd (X) its holomorphic differentials, then W and so V = 
hom(IT, C) are ^-dimensional C-vector spaces. 

Exercise: Show that there is an isomorphism of C-vector spaces 
given by 


fiwWW) - S 2 (N), f(z)dz - f(z). 


(Hint: if f(z) is such a cusp form and o 


a b 
c d 


e r 0 (iv), 


then f(az)d(az) = ( cz + d) 2 f(z)(cz + d)~ 2 dz = f{z)dz. The 
holomorphicity of f(z)dz corresponds to being a cusp form 
since 2iridz = dq/q.) 

This shows that the dimension of S 2 (N) is the genus of Xq(N ) 
(in particular finite), computable by Riemann-Roch (explicitly 
given in [19]). Likewise, the finite dimensionality of S 2 (N,e) is 
bounded by the genus of X\(N). There are similar interpreta- 
tions of Sk(N ) for higher k. 

Let Ci, ..., C 2g denote the usual 2 g cycles on the ^-handled X, 
generating free abelian group Hi(X. Z). Let A = Im(iLi(X, Z) — 
hom(fT, C)) where the map is C i— > (cu i— > fc a;), a discrete sub- 
group of rank 2 g called the period lattice of X. The Jacobian of 
X. Jac(X), is the ^-dimensional complex toms C 5 /A, isomor- 
phic as a group to (R/Z) 2 A This is an example of an abelian 
variety over C. 


The Abel map is X — » Jac(X),x i— > {f£ o a>j} (for some hxed 
Xq whose choice doesn’t matter since the image is defined up 
to a period in A). In the case g = 1, then this is bijective. 
Let Div(X) be the free abelian group on the points of X and 
Div°(X ) its subgroup consisting of the elements whose coef- 
ficients sum to 0. The Abel map extends to a group homo- 
morphism Div(X) — > Jac(X), whose restriction to Div°(X ) is 
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surjective with kernel the so-called principal divisors T(X). 

Since Dw°(Xq(N)) /V{Xq{N)) makes sense over Q, this de- 
fines Jo (AO := Jac(Xo(AO) over Q (in fact it is a coarse moduli 
scheme over Z [1 /N] if N > 3). Galois action on its division 
points then produces Galois representations G q — > GL 2 g (Z/n). 
From this we obtain 2-dimensional Galois representations as- 
sociated to certain modular forms, defined below. 

Definition 4.24 If f is a cusp form of weight k, level N , and 
Nebentypus e, define the mth Hecke operator by 

T m f=m^- 1 Zf IN*, 

3 

where if N = 1, the olj run through A n , and for general N 
through the matrices in A n with gcd{a , N ) = 1. 

Exercise: If / has (/-expansion at oo, a nQ n , fhen T m f has 
(/-expansion 

OO 

E where b n = £ e(d)d k ~ l a mn/cP 

n = 0 d\(m,n) 

(taking e(d) = 0 if (d, N) -=f 1). 

Show that T m f is again a cusp form of weight k, level N, 
and Nebentypus e such that T m is a linear operator on Sk, e ( AT). 
Furthermore, check the Hecke operators commute. 

Definition 4.25 IfT m f = A m f (some \ m G C) for all m, then 
f is called a cuspidal eigenform. Actually, a m = X m ai and we 
shall normalize eigenforms so that a\ = 1 and so \ m = a m . The 
commutative ring the Hecke operators generate, T, is called the 
Hecke algebra. 

For example, since Si 2 ,i(l) has dimension 1, basis A, A is 
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a cuspidal eigenform. A very useful corollary to the definition 
follows: 

Corollary 4.26 If f = T.a n q n is a cuspidal eigenform, the 
map T m i— > a m extends to a ring homomorphism ( eigencharac - 
ter) 9 : T C. 

Consider the (injective) map S k (N ) — > C [[<?]] that sends a 
cusp form to its Fourier expansion at oo. Then the inverse image 
of Z[[q}\ is S k {N; Z) and S k (N; A) = S k {N ; Z) 0z A. Note that 
using the explicit coefficients of T m , T acts on S k (N ; Z). The 
^-expansion principle says that S k (N; C) = S k (N), i.e. S k (N) 
has a basis in S k (N ; Z), and so T embeds in End S k (N; Z). This 
has various consequences: 

Theorem 4.27 T is a finite free Z-algebra. If f is a cuspidal 
eigenform, then there exists an algebraic number ring Of con- 
taining all coefficients of f and the image of e. ( Note that the 
image of 6 above lies in Of.) 

If A is a ring, we let S k (N ; A) denote the cusp forms of level 
N and weight k defined over A. Note that 

S k (N ] A) = hom A _ alg (T,A). (f) 

This follows for A = Z by mapping f : S k (N ; Z) — > hom(T, Z) 
by / >— > (t t— > a\{tff) and then noting that <f is injective and 
that T is free of rank < that of S k (N ; Z) . The general case 
follows by tensoring with A. 

Note that the Hecke operators T m also act on Dw°(Xq(N)) 
(and so Jq(N)) by extending linearly T m [z] = E [cqz], where [z] 
is the orbit of z (E H and runs (again) through all matrices 

j with ad = n, d > 0, gcd(a, N) = 1,0 < b < d. Let I p 


fab 
y 0 cl 
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denote the image of inertia under G q p — > G q, D p the image of 
G q p , and Fr p the element of D p /I p mapping to the Frobenius 
map in Gf ■ For A a prime ideal of Of, denote the fraction held 
of (Of)\ by K\. 

Theorem 4.28 Let f = E a n q n be a cuspidal eigenform of 
weight k, level N, and Nebentypus e. For each prime X, there ex- 
ists a unique semisimple continuous homomorphism p : G q — > 
GL 2 (K\) such that if prime p f£N , then p(I p ) = { 1} , t vp(Fr p ) = 
a p ,detp(Fr p ) = e(p)p k ^ 1 . 

Proof: We explain the case k = 2 and trivial Nebentypus, since 
the elliptic curve representations we care about will correspond 
to this case. 

Let £ be the rational prime below A. Let Jo(N)[£ n ] = (Zf£ n ) 29 
denote the kernel of multiplication by £ n on Jq(N) and T^(Jq(N)) 
lim Jq(N) \£ n ] = Z 2g the corresponding Tate module, a free Z t 
rnodule of rank 2 g on which G q acts (the argument for elliptic 
curves carries over to ^-dimensional tori), where g is the genus 
of Xq(N). The action of T m on Jo(N) given above carries over to 
Ti(Jq(N)). Then W := Tt(Jo(N))®z e Qe is a T<S>zQ,e~ module, in 
fact free of rank 2. (The proof of this comes from the Hodge de- 
composition S 2 (N)@S 2 (N) = H l (X, C), where S 2 (N) gives the 
anti-holomorphic differentials, and the fact that if A is a held of 
characteristic 0, then Sk{N ; A) is free of rank 1 over T <S>zA by 
(f) above.) This yields a representation G q — > GL 2 (T <S>z Qa)- 
Note that the particular level N eigenform has not been used 
yet. 

The eigencharacter T — > Of now induces a homomorphism 
T Q ( ^ Of ®z Since Of is an order in a number held, 
Of<g>zQi — ll\ K\. Mapping onto the Ath component yields p : 
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G q — > GL 2 (I<x). We shall see that p has the desired properties 
in the next chapter. QED 

Exercise: Show that if O is the ring of integers in a number held, 
then the splitting of l in O determines the form of O Gz Q^. 
(Hint: Z[x\/(f(x)) Oz Qf- = Qe[x]/(f(x)).) 

For cases with k > 2, Deligne [7] used higher symmetric pow- 
ers of the Tate module. For k = 1, Deligne and Serre [8] built 
the representation from congruent eigenforms of higher weight. 

Exercise: Let p : G q — » GL n (K ) be a continuous homomor- 
phism where K is a local held with valuation ring O. Consider- 
ing I\ n thus as a G^-module, show that there is a stable lattice 
O n under this action. 

Letting O be the valuation ring of K\ and F its residue held, 
the residual representation associated to a cuspidal eigenform 
/ will denote the composition Gq — ^ GL 2 (0 ) — > GL 2 (F) pro- 
duced above. 

The Big Picture. We have three sources of naturally oc- 
curring Galois representations, namely elliptic curves, modu- 
lar forms, and etale group schemes. The most general is the 
last, and these will be used next to define the general class of 
semistable representations we shall focus on. We shall see that 
this class contains the representations coming from an elliptic 
curve associated by Frey to a putative counterexample to Fer- 
mat’s Last Theorem. The first thing is to create links between 
the various kinds of representations dehned. 
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Invariants of Galois representations, 
semistable representations 


Let F be a finite field of characteristic i > 2 and V a 2- 
dimensional F- vector space on which G q acts continuously. 
Thus we have a representation p : G q — >• GL- 2 {F). By compos- 
ing p with the homomorphism G — > G q, V has a Gc^-action 
and so defines an etale group scheme over Q^. 

Definition 5.1 Call p good at i if this group scheme comes 
via base change from a finite flat group scheme over Z £ and 
also det p\i e = x\h> the cyclotomic character restricted to the 
inertia subgroup fl. Call p ordinary at t if p\j e can be written 

( "V n \ j > 1 < \ 

q ^ I . Call p semistable at t if it is good or ordinary at 
l. 

Let prime p L Call p good at p if p is unramified at p, i.e. 
p(I p ) = {1}. Call p semistable at p if p(I p ) is unipotent (i.e. all 
its eigenvalues are 1). Call p semistable if it is semistable at all 
primes. 
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5.1 Serre Invariants 


Given p as above, we want to define positive integers N(p), k(p), 
and a group homomorphism e(p) : ( Z/N(p)) x — > C x . Suppose 
p is odd, i.e. detp(c) = — 1, where c is complex conjugation, 
and that p is absolutely irreducible, i.e. V <S>f F has no G q- 
invariant proper subspace. We shall consider the following two 
conjectures of Serre [30]: 

Conjecture 5.2 ( Strong conjecture ) There exists a cuspidal 
eigenform f of weight k{p), level N(p), and Nebentypus e(p) 
whose associated residual representation is p. 

Conjecture 5.3 (Weak conjecture) There exists a cuspidal eigen- 
form whose associated residual representation is p. 

Work of Kenneth Ribet and others [24], [10] established that 
the weak conjecture implies the strong conjecture. N(p) comes 
from the local representations p p for p £ and k(p) from pg. 
We begin with the Artin conductor N(p). 

Consider p p : Gq p — > GL' 2 {F) (p ^ £). Let Q t denote the 
(finite) image of the ith ramification subgroup (using upper 
numbering, in fact, [29]; note that Qq = Go and Q\ = G i). Let 
Vi denote the subspace of V fixed by Q % , and set 


DO 


n(p, P) = E 


dim {V/Vf) 


i=0 \Go/Gi\ 

It is a deep theorem [29] that n(p, p) is a non-negative integer. 
More easily, note that 


( i)n(p,p ) = 0 <=> V = Vq p p unramified, (p(Go) = {1}) 


( ii)n(p,p ) = dim(K/Vo) V = V\ <==> p p tamely ramified, (p(Gi) = {1}) 
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Definition 5.4 The Artin conductor of p is defined by 

n(p) = n p nM , 

well-defined by (i) above, since only finitely many primes ramify 
under p. 

Since detp : G q — > F x has abelian image, it factors through 
Gal(L/ Q) for some abelian extension L/Q. By Kronecker- Weber, 
there is a smallest m such that L C Q(£ TO ), and one can 
show that m = £N(p). Thus, det gives a character (Z/m) x = 
(Z/T) x x ( Z/N(p)) x — ■» F x . Restricting this to the second fac- 
tor yields homomorphism e : ( Z/N(p)) x — > F x r ~^ F x . Lifting 
this to Z C C gives e : ( Z/N(p)) x — > C x . This defines e(p). 

We would like to have detp = e{p)x k ^~ l ■, and this determines 
k(p) (mod £— 1 ). By the result at the end of chapter 3, p{Q\) = 
{1}. Thus p\i f : It — > GL- 2 {F) has cyclic image of order prime 
to £, and so is diagonalizable. Let 0, : fi — > F x be the two 

characters this produces. The exact description of kfip) is quite 
complicated but just depends on what 0, are in terms of 
fundamental characters (see chapter 3) [30]. 

Theorem 5.5 p is semistable if and only if N(p) is squarefree, 
e(p) is trivial, and k{p) = 2 or £ + 1. 

Proof: The form of e{p) and of k(p) (mod £ — 1) corresponds 
to det p being exactly the cyclotomic character. The exact form 
of k(p) requires Serre’s prescription above. As for N(p) being 
squarefree, n(p, p) = 1 if and only if Vq is of dimension 1 and 
Vj = V , so if and only if the representation is ordinary at p. 
QED 
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5.2 Fontaine-Mazur Conjecture 

The notions of “good” and “semistable” extend to represen- 
tations to rings other than fields. Suppose R is a complete, 
Noetherian local ring with residue held finite of characteris- 
tic I. Then R is a Z^-algebra and we can view the cyclotomic 
character as mapping to R via x '• Gq — > Z^ — > R x . Let 
p : G q — > GL, 2 (R) be a continuous homomorphism, giving a 
CQ-action on V := R 2 . 

Definition 5.6 If |i?.| < oc ; then call p good at I if the corre- 
sponding etale group scheme over comes from a finite flat 
group scheme over Z<? and det p\p = x\h- For general R, call p 
good at i if for every (closed) ideal I of finite index in R, the 
representation p : G q — > GLflRfl ) is good att. Call p ordinary 
at i if there is a short exact sequence 0 — > V — > V — ■> V° — > 0 
of free R-modules stable under Gq £ , such that R acts trivially 
on C° and as x on V _1 so that 

n \ ~ ( X\ it * \ 

Pll ‘ ~ [ o i ) • 

Call p semistable at t if p is good or ordinary at i and det p\i e = 

X\i r 

The following is a special case of the major conjecture of 
Fontaine and Mazur [12] that says that every potentially semistable 
1-adic Galois representation arises from the Galois action on 
some subquotient of an Cadic cohomology group of a variety 
over Q. 

Conjecture 5.7 (Fontaine-Mazur) If O is the valuation ring 
of a finite extension K of Q g, and p : G q — > GL-2{0 ) a contin- 
uous, absolutely irreducible homomorphism, unramified at all 
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but finitely many primes, and semistable at £, then there exists 
a cuspidal eigenform f whose associated Galois representation 
is p. 

Wiles’ work amounts to establishing some special, nontrivial 
cases of this conjecture. 

5.3 Reduction of elliptic curves 

Our goals now are to introduce Frey’s elliptic curves associated 
to a putative counterexample to Fermat’s Last Theorem, to 
define semistable elliptic curves and show that Frey’s curves 
are semistable, and to show that the Galois representations 
attached to semistable elliptic curves are semistable. First, we 
need a little on reduction of elliptic curves. 

A Weierstrass model for an elliptic curve E over Q is an 
equation of the form 

y 2 + a\xy + a%y = x 3 + a 2 X 2 + a^x + oq. 

By completing square and cube it can be written y 2 = x 3 — 
27 c4X — 54c 6, where c\ = b 2 — 24&4,C6 = —b\ + 36&2^4 — 21666 
and &2 = ci\ + 4o. 2, = 2 aq + a\a^, b$ = a\ + 4a6- This curve 

has discriminant A = (c| — Cg)/1728. 

Definition 5.8 We say that E has good reduction over Q p (or 
at p) if there is a Weierstrass equation for E with cq G Z p that 
(when the coefficients are reduced modulo p) defines an elliptic 
curve over F p ( which happens if and only if the corresponding 
A fL pZ p ). 

We have to be careful - for example, y 2 = x 3 — l and y 2 = x :i — 
64 define the same elliptic curve over Q but with the reductions 
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mod 2 being nonsingular and having a cusp. Another example 
is the Frey elliptic curve with equation y 2 = x(x — A)(x + B), 
where A, B : C := —A — B are nonzero relatively prime integers. 
We shall focus particularly on the case A = a , B = b , C = c , 
with t > 5 a prime and gcd(a, b, c) = 1 (i.e. a counterexample to 
( FLT)i ). For this equation, A = 1 6(ABC) 2 and so if p /2 ABC, 
then E has good reduction at p. For the (FLT)e situation, 
however, we may assume, by relabeling, that A = — 1 (mod 4) 
and B = 0 (mod 32) and then changing variables by x i— > 
4x, y i— > 8y + 4x yields equation 

2 o B - A - 1 2 AB 

y + xy = x H x x. 

J J 4 16 

For this equation, A = 2 ~ 8 (ABC) 2 . Moreover, C4 = A 2 -j- AB + 
B 2 , which is checked to be relatively prime to abc , i.e. v p {c^) = 
0 for all primes p of bad reduction. This is what is called a 
minimal Weierstrass model for the curve, in that | A | is now as 
small as possible, say A m i n . 

Exercise: Considering how changes of variable of the form x 1— > 
rt 2 x, y 1— > u 3 y affect A, C4, Cq, show that a Weierstrass model 
is minimal if v p (A) < 12 or v p (c 4) < 4 or v p (cq ) < 6 for ev- 
ery prime p. (Here v p denotes p-adic valuation.) Show that 
changes of variable do not change the A invariant of the equa- 
tion. Deduce that two elliptic curves are isomorphic over an 
algebraically closed held if and only if they have the same j- 
invariant. 

So, for our example, v p (c^) = 0 gives that the model is mini- 
mal. Thus, ifp|a5c, the Frey curve necessarily has bad reduction 
at p. In fact, its reduction mod p has a node. 

Definition 5.9 Suppose p is a prime of bad reduction for E . E 
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mod p has a unique singular point. If that point is a node ( re- 
spectively cusp), then we say E has multiplicative (respectively 
additive) reduction atp. A node ( respectively cusp) means that 
two (respectively three) of the roots of the cubic are equal. 

Say E has semistable reduction at p if it has good or multi- 
plicative reduction atp. Call E semistable if its reduction at all 
primes is semistable. 

Multiplicative reduction occurs at p if and only if u p (c 4 ) = 
o, «p(A) > 0. (We see this directly for the Frey curves for odd 
primes p since if p divides a, it does not divide b and so only 
two roots of x(x — (f) (x + b ( ) coalesce modulo p .) Thus, Frey 
elliptic curves are semistable. 

What is particularly nice about semistable elliptic curves is 
that their associated Galois representations are semistable. This 
is established via the theory of Tate curves: 

5.3.1 Tate elliptic curves 

Consider first the complex case, where A is the lattice in C 
with basis 1, t(e EC). There is a group isomorphism C/Z — > C x 
given by x i— > e 2mx . Setting q = e 2mT , this yields a group iso- 
morphism C/A — > C x /q z , where q z is the infinite cyclic group 
with generator q. We thus have a multiplicative representation 

of E A (C). 

Explicitly, in terms of q, setting j^x = X + ^ and j^y = 

2Y + X, the equation for E\ becomes: 

y 2 +xy = x 3 +a 4 (g)+a 6 (g),a 4 (g) = -5 s s (q),a 6 (q) = -(5s 3 (g)+7s 5 (g))/12, 

and Sk(q) = Note that a^q) , a^q) E Z [[</]], so this 

defines an elliptic curve over Z [[q]\ since the discriminant of 
this curve is q n^ = i(l — q n ) 2A and we get its /-invariant as ^ + 
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744 + 196884g + ...(*), as usual. Tate observed that much of 
this carries over from C to p-adic local fields: 

Theorem 5.10 (Tate) Let K/Q p be a finite extension with val- 
uation v and absolute value \u\ = c v ^ for some 0 < c < 1. 
Suppose q G K x satisfies \q\ < 1. Then Sk(q), afiq), afiq) are 
defined as above and converge in K . Then y 2 +xy = x 3 + afiq) + 
afiq) defines an elliptic curve E q over K and for any algebraic 
extension L/K, L x /q z = E q {L). 

Since \q\ < 1, the reduction of the curve is y 2 + xy = x 3 , 
which has bad multiplicative reduction (in fact split - meaning 
that the tangent lines at the node are defined over the residue 
held). 

Conversely, if E has multiplicative reduction at p, then its 
j -invariant has v p (j ) < 0 and the series q = j~ l + 744j -2 + 
750420 j~ 3 + ..., inverting (*), converges to give q such that 
v p {q ) = — v p {j ) > 0, so \q\ < 1. The corresponding Tate elliptic 
curve E q is isomorphic over Q p (in fact over a quadratic ex- 
tension of Q p ) to E since they have the same j -invariant. The 
division points of E q are particularly simple to study, whence 
the action of Gq p on them is easily given. 

Theorem 5.11 Let E have multiplicative reduction at p. Then 
there exists q G such that E(Q p ) = (Q p /q z )(S) as Gq p - 
modules, where 6 is the unique unramified quadratic character, 
i.e. (jlyfifix)) = 0(u(x 5 ^)) for all a G Gq ( t , where S : Gq p — > 
{±1} is trivial if the reduction is split, and is the unique un- 
ramified quadratic character otherwise. Thus, 

Peioo\ Gqp = ^ 
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Proof: E q [£ n } C E q { Q p ) and if x G Q p/q z , x r is trivial if and 
only if x r = q m for some m G Z, so if and only if x = Qn(q 1 ^ n ) b 
for some a, b G Z/£ n . Consider the map 

E q [£ n ] ->< q > / < q r > ^ Z/f\ x ^ x e \ 

a homomorphism with kernel / 1 \(n. G q p acts via the cyclotomic 
character on the kernel and trivial on the image since q G Q p . 
QED 

Theorem 5.12 If E is a semistable elliptic curve over Q, then 
Pe/°° '■ G q — > GL 2 (Zf) is semistable. 

Proof: First, the Weil pairing gives that A 2 T^{E) = pe oo 5 cLS 
Galois modules, and so detp is the cyclotomic character. 

Second, if E has good reduction at £ 1 then for each n E[£ n ] 
comes from a finite flat group scheme over Zp namely the min- 
imal Weierstrass model. The representing Z^-algebra 9? is free, 
since tensoring with and gives the same rank and the 
only such finitely generated Z^-modules are free. Then pE,e°° is 
good at £. 

If E has bad so multiplicative reduction at £, then the previous 
theorem gives that the Gadic representation is ordinary (note 
5\i e is trivial). 

Third, if E has good reduction at p £ ) then by Neron-Ogg- 
Shafarevich below E[£ n ] is unramified at p, so pE,e°° is good 
at p. If E has bad so multiplicative reduction at p / f, the 
previous theorem again applies. QED 

Theorem 5.13 (Neron-Ogg-Shafarevich) Suppose p frn . Then 
E has good reduction at p if and only if the GQ-action on E[m\ 
is unramified at p. 
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Proof: We show the forwards direction, which is the one we 
need. Let K be a finite extension of Q p such that E[m\ C E(K). 
Let F be the residue held of K. One checks that the kernel of 
reduction <f : E(K ) — > E(F ) has no points of order prime to p. 
Thus the restriction E[m\ — > E(F) is injective. Let P G E[m\ 
with reduction P, cr G I p . Then <f>(cr(P) — P) = a (P) — P = 0, 
and so by injectivity <r(P) = P. Thus, I p acts trivially on E[m\. 
QED 

This result is likewise true for abelian varieties, in particular 
Jo (AO, and thus shows that the representations at the end of 
chapter 4 are unramified at primes not dividing £N. Moreover, 
in both cases, the action on Fr p on the m-division points is 
given by their action on their reductions. 

Theorem 5.14 Suppose E, an elliptic curve over Q, has mul- 
tiplicative reduction atp > 2, which may or may not be £. Then 
p E ,e is good atp if and only if £ \v p (A min ). 

Proof: Consider the ^-division held of E q , namely K = Q P (Q, q 
Up ^ £, note that £\v p (A m i n ) £\v p (q) K / Q p imramihcd 

p^yunramihed at p. For p = £, need the argument in Edix- 
hoven’s article in [ ]. QED 

For the Frey curves related to (PLT)^, A rnin = 2 ~ 8 (abc)' 2( and 
so £ divides v p (A m i n ) for all odd primes p. By the last criterion 
PE,e is good at every prime except possibly 2. Being good at £ 
gives k(pEj) = 2, whereas being good at the other odd primes 
and semistable at 2 gives N(pE.e) = 2. Since its determinant is 
just the cyclotomic with no twist, e(pEp) is trivial. Thus, if ps,e 
were associated to a cuspidal eigenform, then it would come 
from 62(2) = {0}, a contradiction. 

Moreover, Mazur’s work [ ] shows that pE,£ is absolutely ir- 
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reducible and so a counterexample to (FLT)p would violate 
Serre’s strong conjecture. The idea for this is that if say pe / 
were reducible, then E[£\ would have a subgroup or quotient of 
order £ on which G q acts trivially. Such a thing corresponds 
to a rational point on XfiT), but Mazur found all of them and 
there are no noncuspidal ones for large i. 

Ribet [24], which inspired Wiles to begin his long journey to 
the proof, took the approach of trying to prove that Serre’s 
weak conjecture implies his strong conjecture. This then shows 
that a counterexample to ( FLT\ would violate pE,e being as- 
sociated to any kind of cuspidal eigenform, i.e. the Shimura- 
Taniyama conjecture. Ribet, Diamond, and others established 
the full “weak implies strong” conclusion later [ ], but Ribet at 
first did enough for our purposes. 

Theorem 5.15 (Ribet) Suppose p : G q — > GL-j^F) is an odd, 
continuous, absolutely irreducible representation over a finite 
field F of characteristic i > 2. Suppose p is associated to some 
cuspidal eigenform of level N, weight 2, and trivial Nebentypus. 
If p is good atp\\N , then p is associated to some cuspidal eigen- 
form of level N/p whenever p 1 (mod £) or gcd{I , N) = 1. 

A description of Ribet ’s approach will be given in an ap- 
pendix. 

Corollary 5.16 The Shimura-Taniyama conjecture, that every 
Galois representation attached to an elliptic curve is attached 
to a modular form, implies Fermat's Last Theorem. 

Suppose we have a counterexample to ( FLTfi,£ > 5. If the 
Shimura-Taniyama conjecture holds, then the associated Frey 
curve is associated to some cuspidal eigenform of squarefree 
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level N. It is a theorem of Mazur that says that the corre- 
sponding ^-division representation is absolutely irreducible. By 
repeated use of Ribet’s theorem, this representation is associ- 
ated to some cuspidal eigenform of level 2 (weight 2 and trivial 
Nebentypus), but there are no such. 

The Big Picture. The proof of Fermat’s Last Theorem is 
hereby reduced to proving that all semistable elliptic curves are 
modular. This will be achieved by showing that certain families 
of semistable Galois representations are all associated to mod- 
ular forms. We therefore need some way of parametrizing such 
families and this will be provided by universal (semistable/modular) 
Galois representations. 
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A good description of this material can be found in [14]. The 
original source is [20]. 

Fix a prohnite group G, a finite held F, and a continuous 
representation p : G — > GL n (F). Let Cp denote the category 
whose objects are complete, Noetherian local (commutative) 
rings R with residue held R/ yyir = F and whose morphisms 
are ring homomorphisms <f> : R — > S such that (/)(iyir) C m # 
and such that the induced map R/uir — > is the identity 

map on F. Cohen’s theorem [2] says that every ring in is 
a quotient ring of W(F)[[Ti, ..., T m ]\ for some m and so is a 
W (F)-algebra. 

Definition 6.1 A lift of p to a ring R in Cp is a continuous 
homomorphism p : G — » GL n (R ) which modulo m# produces 
p. Two such lifts pi and p% are called strictly equivalent if there 
exists M e T n (R) := ker(GL n (R ) — > GL n (F )) such that p 2 = 
M~ l p\M . A deformation of p is a strict equivalence class of 
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lifts. 

Let E(R) = {deformations of p to R). A morphism R — > 
S induces a map of sets E{R) — > E(S), making E a functor 

Cp — > Sets. We establish conditions under which E is 

representable, so that there is one representation parametrizing 
all lifts of p. 

Say that G satishes (f) if the maximal elementary Gabelian 
quotient of every open subgroup of G is finite. Examples in- 
clude: 

(!) G Qp for any p\ 

(2) let S be a finite set of primes of Q and Gq,s denote the 
Galois group of the maximal extension of Q unramified outside 
S, i.e. the quotient of G q by the normal subgroup generated 
by the inertia subgroups I p for p 0 S. 

These satisfy (f) by class field theory. Note that by Neron- 
Ogg-Shafarevich a Galois representations associated to an el- 
liptic curves or a modular form factors through G q s for some 
S. 

Theorem 6.2 (Mazur) Suppose that p is absolutely irreducible 
and that G satisfies (f). Then the associated functor E is rep- 
resentable. 

That means that there is a ring 3ft (p) in Cp and a continuous 
homomorphism £ : G — ■» GL n {fit(p)) such that all lifts of p to 
R , in CV arise, up to strict equivalence, via a unique morphism 
3ft (p) — > R. The representing ring R(p) is called the universal 
deformation ring of p and (the strict equivalence class of) £ the 
universal deformation of p. For example, if n = 2, p is odd, 
F = 7i/Z , and G = Gq,s with £ e S, then 3ft (p) is typically 
Z t [[T u T 2 ,T 3 }}. 
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One proof of Mazur’s theorem follows Schlessinger’s criteria 
[27], which were developed to study local singularities. We let 
C° F denote the subcategory of consisting of Artinian rings. 
Note that if R is in Cp, then every R/m l R is in C° F . The dual 
numbers F[e\ = F[T]/(T' 2 ) with e the image of T. A morphism 
R — > S is called small if it is surjective with kernel a principal 
ideal whose product with m# is 0, for example the map n : 
F[e\ — > F given by a + be i— > a. 

Definition 6.3 Suppose E : Cp > Sets is a functor sat- 

isfying \E(F)\ = 1. Let R\ — > Rq and R 2 — > Rq be morphisms 
in C F . Consider the natural map 

(*) E(R\ x Rq R 2 ) — > E(R\) x e( R q) E{Rf) : 

which exists since E(R\) x E ( R q ^ E(R 2 ) is the terminal object in 
the category of sets 



E(R r) E(R 2 ) 



E(Rq) 


Schlessinger’s criteria are as follows: 

HI. R 2 —> Rq small implies (*) surjective. 

H2. If Rq = F, R 2 = F[e\, and R 2 — > Rq is the map 7 r above, 
then (*) is bijective. 

Note: If H2 holds, then tp := E(F[e\) has an F -vector space 
structure (the tangent space of E). 

H3. tp is a finite- dimensional F -vector space. 

H4. If Ri = R 2 and Rj — > R,q(i = 1,2) is the same small 
map, then (*) is bijective. 
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Theorem 6.4 (Schlessinger) HI, H2, H3, H4 hold if and 
only if E is representable. 

Proof: So to prove Mazur’s result, we need to show that his 
functor E satisfies HI, H2, H3, H4. Let £ t denote the set of 
continuous homomorphisms G — > GL n {Rf) lifting p. Let K t = 
T n (Ri). Then E{Rf) = £*/£*. Let R% = R\ x Ro R 2 . We are 
interested in the map 

(*) E3/K3 — > E1/K1 x Eq /k 0 E2/K2, 

when R 2 — > Rq is small. 

To show (*) is surjective then, we take p\ £ E\ , p2 £ £2, 
yielding the same element of Eq/Kq , i.e. pi = M~ l p2M for 
some M £ Kq. Since R 2 — > Rq is surjective, so is I\ 2 — > Ko- 
If N £ K 2 maps to M, then {pi : N^ l p 2 N) gives the desired 
element of £3. 

Exercise: (*) is injective if C K 2 (p 2 (G)) — > Ck 0 (po(G )) is surjec- 
tive. 

If p is absolutely irreducible, then both these groups consist 
of scalar matrices and so this holds, ensuring H4 holds. This 
actually follows from Nakayanra’s Lemma, since e.g. R.2[G] — > 
EndR 2 {R 2 ) is surjective after tensoring with £ thanks to the 
absolute irreducibility of p. If Rq = £, then Kq = {1} and so 
the condition holds, whence H2 follows. 

As for H3, note that T n (£[e]) is isomorphic to the direct 
product of n 2 copies of £ + , so is an elementary abelian t-group. 
Thus any lift p : G — > GL n {F[e\) of p factors through G/H , 
where ker(p)/H is the maximal elementary abelian quotient of 
ker(p). By (f), G/H is finite and so there are finitely many lifts 
to £[e], proving H3. QED 
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We shall be interested in families of representations satisfying 
some further condition, such as semistability. For this we need 
Ramakrishna’s refinement of Mazur’s result. 

Let X be a property of W (F) [Gj-modules of finite cardinality 
which is closed under isomorphism, direct sums, taking sub- 
modules, and quotienting. Fix p : G — > GL n (F ) such that F n 
considered as a W (F) [Gj-module via p satisfies X. For R e C° F 
(which must then have finite cardinality), let Ex(R ) denote the 
set of deformations in E(R) satisfying X. 

Theorem 6.5 ( Ramakrishna ) Ex is a functor on C° F . More- 
over, if E satisfies HI, H2, H3, H4, then so does E\ ( in which 
case both functors are representable, where Ex is extended to 
the category C F by Ex(R) = lim E\ ( R/ m ? ff ) ) . 

Proof: Let R , S be objects in C ° F and <f : R — > S a morphism. To 
show Ex a functor, we need to show that if p : G — » GL n (R ) 
has X, then the composition map G GL n (S ) also has X. 
This follows since if B = R n and D = S n both with the given 
G-action, then <f induces B — > D making D a finitely generated 
(it’s finite!) F-module, say a quotient of B m - since having X 
is closed under direct product and quotient, B and so B m and 
so D all have X. 

The next thing to note is that HI for Ex implies H2,H3,H4 
too. This follows since restrictions of injective maps to subsets 
are still injective. This gives injectivity in H2 and H4 with HI 
giving surjectivity. As for H3, since Ex(F[e]) C E(F[e\), the 
tangent space of Ex is also finite-dimensional. 

To prove HI for Ex, set R 3 = R\ x Ro R 2 . Let p\ x po p 2 G 
Ex(Ri) X E X (R 0 ) Ex(R2)- By HI for E , we get p G E(R 3) map- 
ping to this element. We just need to show that p has X. Well, 
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Rs Ri x R 2 induces iftf R™ x Rlj, making iftf a submod- 
ule of a direct product of W(F) [G]-modules with X, whence 
i ?3 with this G- action has X. QED 

Theorem 6.6 Suppose that E and Ex satisfy the hypotheses 
of the previous theorem. Let 3ft and 3ftx be the respective defor- 
mation rings. Then there is a natural surjection 3ft — > 3ftx- 

Proof: Let £ : G — > GL n (3ft) and £x : G — > GL n (3ftx) denote the 
universal deformations. Since £x is a lift of p and £ parametrizes 
all such, there is a (unique) morphism <f : R — > iftx which after 
composition with £ yields £x- Let the image of <f be S. We 
thereby get a representation p : G — > GL n {S), which is of type 
X. By universality of 3ft x we get a unique morphism 3ftx — > S 
producing p. The composition 3ft x — > 5 3ftx, by universality 

again, has to be the identity map, so S = 3ftx- QED 

We shall show that semistability defines such a property X. 
Namely, fix a finite held F of characteristic £ > 2 and a con- 
tinuous homomorphism p : G q — > GL 2 (F) which is absolutely 
irreducible and semistable. (Note that semistability includes 
that det p be the cyclotomic character and this makes p odd, 
so we need not make this an extra condition.) Fix a finite set 
E of rational primes. Let R be in C ° F . If p : Gq — >• GL 2 (R ) is a 
lift of p, we say that p is of type E if 

(1) det p is the cyclotomic character; 

(2) p is semistable at l\ 

(3) if £ £ E and p is good at £, then p is good at £\ 

if p E U {£} and p is unramified at p, then p is unramified 
at p; 

if p E U {£} and p is ramihed (so ordinary) at p, then p is 
ordinary at p. 
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Note: (1) If E is a semistable elliptic curve over Q and Pej: is 
absolutely irreducible, then pE,e°° is a lift of type £ if £ contains 
all the primes of bad reduction for E. 

(2) If p is of type £ C £', then p is of type £'. 

(3) If p is of type £, then p is unramified outside {p : p\£ N(p)}U 

£. 

(4) A lift p of p, unramified outside £, with det(p) the cyclo- 
tomic character, semistable at £, is of type £ U {£). 

Theorem 6.7 Given a continuous absolutely irreducible homo- 
morphism p : G q — > GL, 2 (F) and given £, there exists an ob- 
ject in Cf and a continuous homomorphism ps : Gq — > 
drifts) such that every lift p of p to R in C f, of type £, 
is strictly equivalent to the composition of ps with some f, a 
unique morphism from to R. 

The proof proceeds by dealing with each of the conditions re- 
quired for type £ in turn. First, we deal with the group scheme 
condition. To apply Ramakrishna’s criterion we consider the 
etale group scheme over corresponding to Gq ( — > GL n (R ) 
with RiriCf and say that p has property X if this group scheme 
extends to a finite flat group scheme over Z<?. We just need to 
show that X is preserved under direct sums, sub, and quotient. 

Definition 6.8 If G is an affine group scheme over Z^, say 
represented by A, let G gen be its fibre (or base change ) over 
so represented by A <S>z e Qig Lot H gen be a closed subgroup 
scheme of G gen , say represented by ( A <S> z £ Q t)/J where J is 
a Hopf ideal ( ideals that ensure the quotient is still a Hopf al- 
gebra). Let I = 0 -1 (J) under <f : A — > A <S>z e Q^- Then the 
group scheme H represented by A/ 1, a subgroup scheme of G, 
is called the schematic closure of H gen . 
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Lemma 6.9 A/ 1 is torsion-free. 

Proof: f induces an injection A/ 1 (. A <g>z e Q e)/J, which is 

torsion-free since it is a Q^- algebra. QED 
It follows that H is a finite flat group scheme over Z^. This 
is why goodness is preserved under sub. As for quotient group 
schemes, if H and G are group schemes over R with H(A) 
a normal subgroup of G(A) for all /?- algebras A, then typi- 
cally F(A) = G(A)/H(A) does not give a representable func- 
tor F on A-algebras. Raynaud showed how to fix this. Let 
A' = {a G A\fi(a) = 1 0 a (mod / 0 A))}. Then A' is con- 
sidered as representing G/H . Finally, as regards direct sums, if 
g. H are finite flat group schemes over represented by R, S 
respectively, let F(A) = G(A) x H(A) for every Z^- algebra A. 
Then check that F is an affine group scheme represented by 
R 0 z e F, which is a finite flat Z/-algcbra. 

The second kind of condition is of the following nature. Let 
G be a profinite group and / a closed subgroup. Call p : G — » 
GLi 2 (R) /-ordinary if the hxed points of R 2 under / form a free 
direct summand of rank 1. 

Theorem 6.10 If p is I -ordinary and absolutely irreducible 
and G satisfies (f) ; then it has a universal I -ordinary lift. 

Proof: Again, we need only prove HI. Let Ej(R ) denote the set 
of /-ordinary deformations of p to R. Let R% = R\ x# 0 R.^ and 
consider pi x Po P 2 C Ei(Ri) x e^Rq) E^Rf). Then since Mazur’s 
functor E is representable, we get a p^ £ E(Rf) mapping to 
pi x po p 2 . It remains to show that p^ is /-ordinary. QED 
The third kind of condition is just that p have determinant 
the cyclotomic character. Without imposing this condition, we 
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so far have a universal deformation £ : G q — > GL 2 ^R) satisfying 
everything else contained in type E. In particular, for p S, 
£ is unramified at p. Thus, £(Fr p ) (for p 0 5) are defined. Let 
det^(Fr p ) = r p G 31 and let / be the ideal of 3? generated by 
r p — p for p ^ S. The representation £ = £ (mod /) : G q — > 
GL 2 ^St/I) now has det^(Fr p ) = r p = p = x(Er p ) for p ^ S. 
By Chebotarev’s density theorem, the Fr p {p S ) are dense in 
Gq,s and so def£ = y. 

Incidentally, imposing all these conditions has actually re- 
duced us to a ring 3ft// which will turn out to be finite over 

Z/. 

Finally, we need another description of the tangent space of 
our functors. Suppose E is a representable functor on Cp, rep- 
resented by 3?. Set tp = E(F[e]), the tangent space of E, a 
finite-dimensional F-vector space. This dimension will turn out 
to be a useful invariant of both E and 3£, calculated via Galois 
cohomology. 

Note that tp = hom(3ft, F[e]). Under every such morphism, 
msft maps to {be\b G F} with kernel containing m| + /3ft, such 
that tp is the dual space of m^/ (m| + /3ft). For example, if 3ft = 
W(F)[[Ti, ...,T r ]], then = (Ti, ..., T r , t) and so dim ptp = 
dim^(mgfj/(m| + /3ft)) = r. Furthermore, if I is an ideal of 3ft 
such that I C m| + /3ft, then the tangent space of 3 ft// is also 
r-dimensional. 

The Big Picture. We have seen that given a semistable p 
and a finite set of primes S, there is a universal lift £s : G q — > 
GL2(3ftx) which parametrizes all lifts of p that are semistable 
at / and that have no worse ramification at p E than p has. A 
very rough idea of how big 3ft^ is, can be given by the dimension 
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of its tangent space. We shall shortly describe a universal lift 
of p of type S parametrizing those lifts associated to modular 
forms and describe a useful invariant for measuring how large 
that representing ring. Some commutative algebra will then es- 
tablish how an inequality between these two invariants suffices 
for showing that the two universal lifts (type S and modular 
of type S) coincide. First, we need to describe how to compute 
the tangential dimension of in terms of Galois cohomology. 
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A good introductory text for this chapter is [35]. For more 
advanced material, see [21]. 

Fix p : G — > GL n {F). Consider the possible lifts cr of p to 
GL n (F[e]). In particular, there is the trivial lift, which we shall 
also denote p, arising from the embedding F — > F[e\. Then 
a(g) = (1 + ea{g))p{g) for some map a : G — » M n {F). The 
fact that o is a group homomorphism, i.e. cr(gh ) = a (g)a(h), 
translates into a(gh) = a(g) + p{g)^ 1 a{h)p{g). 

Definition 7.1 Let M be a G -module ( where the actions are 
continuous if G is pro finite). A 1- cocycle is a (continuous) map 
f : G — >• M such that f(gh ) = f(g) + gf(h ) for all g,h G G. 
A 1-coboundary is a (continuous) map f : G — >• M given by 
f(g ) = gx — x for some (fixed) x £ M . These are 1- cocycles 
and we shall set H 1 (G,M) to denote the quotient group. 

Exercise: Check the 1-cocycles and 1-coboundaries do indeed 
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form abelian groups, the first containing the second. Show that 
if G acts trivially on M, then H 1 (G : M ) = Hom(G : M ). 

Note that GL n (F ) acts on M n (F ) by conjugation (the adjoint 
action). Thus, the map p makes M n (F ) into a G- module, to be 
denoted Ad (p). As noted in the first paragraph above, every 
lift of p to F[e] produces a 1-cocycle. One checks that if two 
lifts are strictly equivalent, then their 1-cocycles differ by a 
1-coboundary. This yields a map from E(F[e\) (deformations 
to the dual numbers) to F[ l (G, Ad(p)). Conversely, given a 1- 
cocycle a : G — » Ad(p), defining a(g) = (1 + ea{g))p{g ) gives a 
lift of p to F[e\. In this way, we get a bijection 

E(F [e])*H\G,Ad(p))- 

Note that this gives an alternative way of seeing that E(F[e]) 
is an F-vector space. If we impose some property X and look 
at the deformations of p that have X, i.e. Ex(F[e]), this cor- 
responds to some subgroup of i/ 1 (G, Ad(p)) which we denote 
Hx(G, Ad(p)). In particular, if we start with a semistable, ab- 
solutely irreducible representation p : G q — > GL 2 (F) (F of odd 
characteristic £) and consider lifts of type S, then the subgroup 
will be denoted H^(Gq, Ad(p)). 

We want to identify this set by considering what sort of 1- 
cocycles correspond to the properties involved in being of type 
S. 

First, we impose the condition that the lift o should have 
determinant the cyclotomic character y, which is the same as 
the determinant of p. Then det(l + ea(g)) = 1 for all g £ G q. 
Since 

/ 1 T ean ea\2 

y £^21 1 + €022 


1 + ea(g ) 


5 
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and e 2 = 0, this gives l+e(an+a 22 ) = 1, implying trace(a(g)) = 
0. The GQ-submodule of Ad (p) consisting of trace 0 matrices 
will be denoted Ad°(p). We are therefore only interested in 
subgroups of H 1 {Gq, Ad°(p)). 

Second, if A/ is a subgroup of G and M a G-module, then we 
can restrict 1-cocyles and 1-coboundaries from G to H , thus 
producing a restriction map 

res : H l (G, M) H\H, M ). 

More generally, any homomorphism H — > G will produce such 
a restriction map. The local behavior corresponding to type £ 
will be captured by the restrictions from G q to G q p and/or 
I p . For example, suppose a lift a corresponds to an element 
of the kernel from H x {Gq, Ad°(p)) — > H 1 (I p , Ad° (p)) . Then 
a(g) = p(g ) for all g £ / p , so that a is unramified at p if and 
only if p is unramified at p. Being in the kernel in fact ensures 
that the ramification at p of a is no worse than that of p. 

Definition 7.2 Let M = Ad°(p). By local conditions we mean 
a collection C = {L p }, where for each prime p (including in- 
finity) we are given a subgroup L p < H 1 (Gq v , M) such that for 
all but finitely many p L p = ker(H l (GQ i , M) — > H 1 (I p: M)) . 
(These are called the unramified classes and will be denoted 
H l ur {G^M).) The corresponding Selmer group will be 

HKGq, M) = {ce H l (G Q , M) : res p (c) e L p for all p}, 
where res v : H 1 (Gq, M) -> H 1 (Gq , M) is restriction. 

Note that by Qoo, we mean the real numbers, and so Gq x = 
Gal( C/R) of order 2, identihed as the subgroup of order 2 of 
G q generated by complex conjugation. Since F has odd char- 
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acteristic, = {0} by the following, and so has 

to be {0}. 

Example: Suppose G has order 2 and M odd order. Show that 
H\G : M) = {0}. 

Proof: Let G = { 1 , c} . A 1-cocycle is a map / : G — > M 
such that f(gh ) = f(g ) + gf(h ) for g,h G G. Setting g = 1 
gives /( 1) = 0. Setting g = h = c gives 0 = /(c) + c/(c). So 
c G := {m G M|cm = — m}. Conversely, if m G M“, 
defining / by /(l) = 0, /(c) = m gives a 1-cocycle. 

If / is a 1-coboundary, then f(g) = gx — x for some x G M. In 
particular, /(c) = cx — x. Consider the map / : M — > M~ given 
by /(x) = cx — x. Let M + = {m G M|cm = m}. Since \M\ 
is odd, M = M + ® M~(m = rr ^f R + 7? ^). Then |/m(/)| = 
|M |/|/\er(0)| = |M|/|M + | = \M~\ and so / is surjective. QED 

Exercise: If G is infinite (pro)cyclic, generated say by g and 
M is finite, show that H l {G,M) = M/{fg — 1 )M). [Hint: if 
m G M, define / : G — > M by f(g l ) = m + + g 2 m + 

... + g l ~ l m. Show that / is continuous, i.e. there exists N such 
that if i = j (mod N), then f(g l ) = f(g j ), and satisfies the 
1-cocycle condition. Conversely, if / is a 1-cocycle, use the 1- 
cocycle condition to obtain the form of f(g l ) by induction on 

<•] 

Theorem 7.3 HUGq. Ad(p)) = H' C IG Q . M), where M = Ad °(p) 
and £ is given by: 

Too 0; 

L P = Hl(G Qp , M) ifp? SU{(}, 

= H 1 (G Qp ,M) ifpeY,,pjU, 

= H}(G Qp ,M) i}p = l<tY,, 

= Hl(G Qp ,M) i/p = (eE. 
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Here, Hj and H] s are the flat and semistable cohomology 
groups respectively, to be defined below. 

This theorem is simply a restatement of what it means for a 
lift to be of type £ - all we do is translate the conditions across 
to cohomology. Our main aim will be to compute the size of 
this Selmer group, giving the tangential dimension of Ifts- First, 
since the property of p being good or ordinary is defined in 
terms of the CrQ-module F 2 , we need another interpretation of 
H 1 . 

Definition 7.4 Given p : G — » GL n {F), consider V = F n , a 
G-module thanks to p, and consider the set of extensions E of 
V by V , consisting of short exact sequences 

0 _> V E V -> 0 

of F[G]-modules. Call two such extensions equivalent if there 
is an isomorphism i : E\ — > £2 making the following diagram 
commute 


0 — - r — - /•:, — -0 


lv 

i 

i 

' > 


0 — — - E 2 — — -0 
Let Extp^(V,V) denote the set of equivalence classes. 

Theorem 7.5 There is a bisection between H l (G, Ad(p)) and 
Ext l m (V,V). 


Proof: Pick <f : V — >■ E such that = m for all m G V. 

Given g £ G, dehne T g : V — > V by 


m 1 — > a l (g4>(g — 
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T g can be considered as an n by n matrix. One checks that 
T g k = T g + gTh, where gT \ means the matrix Tj x conjugated by 
p(g). One also checks that equivalent extensions correspond to 
1-cocycles that differ by a 1-coboundary. QED 
Recall that an A [GqJ - module V of finite cardinality is good 
if there is a finite, flat group scheme H over Zf such that V = 
H( Qe) as FfOgJ-modules. Recall that V is called ordinary if 
there is an exact sequence 

0 -> V^ 1 -» V -> R° 0 

of F[GQ e \ -modules such that A acts trivially on R° and via 
the cyclotomic character on R -1 . V is called semistable if it is 
good or ordinary (or both). 

Definition 7.6 Identify H 1 (GQ n Ad(p)) with Extp^^V^V). 
Let HI s (Gq^ Ad(p)) consist of those extensions ofV by V that 
are semistable. If p is not good at £, take Hj(GQ e , Ad(p)) to 
be Hi. If p is good at £, take Hj(GQ a Ad(p)) to consist of 
those extensions ofV by V that are good. Hl(G q^, Ad°(p)) and 
H\ {Gq £ , Ad° (p)) are defined by intersecting the above subgroups 
with H l (GQ e ,Ad°(p)). 

Theorem 7.7 H^(Gq, Ad°(p)) is a finite group. 

Note: we actually already know this theorem since its dimen- 
sion over F is the tangential dimension of but it is good 
preparation for finding the exact order of the Selmer group.. 

Proof: Let M = Ad°(p). Let A be a finite set of primes con- 
taining all the “bad” ones, i.e. oo, £, the primes p such that I p 
acts nontrivially on M, and such that L p ^ H\ r . By definition 
of Selmer group, there is an exact sequence 
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o -> H&G^M) - H\G q ,s,M) -> ©pcs-ff'iGq,, M)/L p . 

Let H = ker{G^s — > Aut(M )), the kernel of the action of 
Gq ,5 on M. Since M is finite, H is an open, finite index, nor- 
mal subgroup of Gq ; 5 . By an above exercise, since H acts triv- 
ially on M, = Hom(H, M). This is a finite group 

since the Hermite- Minkowski theorem says that there are only 
finitely many extensions of degree l of the fixed held of H (a 
finite extension of Q) unramified outside the primes above S. 

The inflation-restriction exact sequence (see below) says that 

0 ^ H 1 (G q ,s/H,M h ) ^ H 1 (GQ t s, M) -> 

is exact, where M H := {m G M\gm = mfor allg G H}. Since 
Gq^s/H and M H are hnite, so is H 1 {Gq,s/ H. M H ) and so is 
as noted above. Thus, so is H l (Gq^s,M) and so is 
its subgroup H^(Gq, M). QED 

The next theorem identifies the kernel of the restriction maps 
introduced above. 

Theorem 7.8 (Inflation- Restriction) If M is a G-module and 
H is a normal subgroup of G, then M H is a G/H-module and 
there is an exact sequence: 

0 — H\G/H, M h ) -V H\G, M ) h\H, M). 

Corollary 7.9 Hl(G % ,M) = H^G^J I p , M 1 -) 

The advantage of working with cohomology groups is that 
they fit into various exact sequences and perfect pairings, al- 
lowing them to be computed in terms of simpler objects. One 
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such simpler object is which is defined to be M G . 

An example of this sort of simplification is: 

Theorem 7.10 For finite M, \Hl(G Qp ,M)\ = \H°(G Qp ,M)\ < 
oo. 

Proof: Consider 

0 _> M Gq p -> M Ip M Ip -> M Ip /{{Fr p - 1 )M Ip ) -> 0 
and by the last exercise, 

H\G Qr /I p ,M y = ff‘(< Fr p >,M I ») = M I 0((Fr p -l)M I <‘). 

QED 

One can define 2-cocycles and 2-coboundaries to be certain 
maps / : G x G — > M. Namely, a 2-cocycle / is a map that 
satisfies 

gf{h , &) - f{gh , &) + f(g , hk ) - f(g , h) = 0, 

whereas a 2-coboundary is an / of the form 

f(g,h)=gF(h)-F(gh)+F(g) 

for some F : G — > M. Then take i7 2 (G, M ) = {2— cocycles}/{2— 
coboundaries}, and in fact we can likewise define r-cocycles and 
r-coboundaries. 

Theorem 7.11 Suppose O^-A^B^-C^Oisa short ex- 
act sequence of G -modules. Then there is a long exact sequence 

0 tf°(G, A) -► hT°(G, 5) hT°(G, G) -» 

H\G,A) /^(G, U) tf^G) -» 

H 2 (G, A) CT 2 (G, 5) 77 2 (G, G) . . . 
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This sequence continues with groups H l (G, A) defined for all 
integers i > 0. 

Before giving Wiles’ big result on sizes of Selrner groups, we 
need some preparatory results. Note that if A and B are G- 
modules, then hom(A, B) has a G-action given by ( g(f))(a ) = 
g(f(g~ l a)) where g G G, f G hom(T, B),a E A. 

Theorem 7.12 (Local Tate duality) Suppose M is a Gq i -module 
of finite cardinality, n. Set M* = hom(M, /i n (Q p )), where /i n ( Q p ) 
is the nth roots of 1 in Q p given its natural GQ p -action. 

There is a nondegenerate pairing for i = 0, 1, 2 

H'(G Qt ,M) x H 2 -'(G Qr ,M’) H 2 (g Qr ,Vn) ^ Q/Z. 

If p does not divide the order of M , then under the pairing 
Hfi. ( Gq p , M) and H) Lr (GQ v , M*) are the exact annihilators of 
each other. 

Theorem 7.13 (Wiles) Let C = {L p } be local conditions and 
= n;}, where L* C H 1 (Gq p , M*) is the annihilator of L p 
under Tate’s pairing. By the last remark, these are also local 
conditions. 

\Hl(G Q ,M)\ _ \H°(Gq, M)\ \L p \ 

\W c .(G q , M-) | \H a (Gq, M*)\ p hoc I W(G Qp , M ) | 

Note that since for all but finitely many primes L p = H) ir (G q p , M), 
which has the same order as H°(G q p , M), the righthandside is 
a finite product. 

Proof: Here’s the idea. The Poitou-Tate 9-term sequence (see 
e.g. Milne’s book) and the definition of the Selrner group yield 
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exact sequence (S being the set of bad primes introduced earlier 
and c complex conjugation): 


0 ^ H°(Gq s, M) -> (e p€S H 0 (G Qp ,M))/((l+c)M) -> ff 2 (G Q , s , Af ) A 


Hi(G Q, M) ^ - H\G Q,s, M*) A ^ ff^Gq, Af*) A ^ 0 

Here H A denote the dual hom(H, Q/Z). 

The alternating product of orders of groups in an exact se- 
quence is 1, and so 

Igagopni |g 0 (G Q ,s,M)||g 2 (G Q ,s,Ar)||(l + c)Af|n pe s|I, p 
| Hi, (G„, M>) I 1 (G q , Si M*) I n pes |tf° (Gg p , M) I 

Since we are happiest computing orders of H ih s (or if neces- 
sary H 1 ’’ s), we need some way of removing the H 2 term here, 
which is provided by the global Euler characteristic formula : 




| AT 


= l(l+c)M|. 


\H°(G QtS ,M*)\\W(G QtS ,M*)\ |T°(G Qoo ,M*)| 

QED 

Note that this last module is what was called M + in a previous 
example. There is also a local Euler characteristic formula : 

Theorem 7.14 If M is a finite G^ p -module, then 

\H\G q .M)\ 


\H°(Gq , M)\\H 2 (Gq , M)\ 


= P 


Vp( \ M \) 
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This will be needed in studying how H ^ varies as we vary 
S. Suppose, for instance, we add a prime into S, yielding S'. 
Then maps onto We can control the difference in their 
tangential dimensions as follows. 

Theorem 7.15 Suppose \M\ is a power of £ and q=f £ a prime 
for which L q = Hb(GQ q , M). Define CJ by L' p = L p if p q, 
and L' q = H 1 (Gq q , M). ( This corresponds to S' = S U {q}.) 
Then 


Hb(G Q ,M)\ 

W C (G Q ,M)\ 


< |ff°(G Q „M*)|. 


Proof: If C is replaced by consider what happens to the 
terms on the righthandside of Wiles’ formula. They remain the 
same except for the term for p = q, which changes from 1 to 
\ H \ G Q q , M)\/\H { \ G Q q , M)\ = \H 2 (G Qq ,M)\ , using the above 
Euler characteristic formula. By the local Tate duality pairing, 

We must also consider the effect on C*. Let C* denote the 
new dual local conditions. Since L'f = 0, the conditions defin- 
ing Hb* are more restrictive than those defining | Hb- Thus, 
M>)\ < \H}..(G Q , M*)\. 

Putting this together, 


HpG Q ,M)\ 1 

Hc”{Gq, M“) 

Hl(G Q ,M)\ 

w c ,(g q ,m>) 1 


H°(G Q .An\<\H°(G Qq ,&n\. 


QED 

The remaining matter is to compute the individual terms on 
the righthandside of Wiles’ formula. Let us start with p = oo. 
H\G^M) = M G b oo. We are assuming p is odd so can take 
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gation by that matrix, easily computed to be { 


and so with 


\a e 


1 0 
0 -1 

M = Ad u (p), we seek those trace 0 matrices fixed under conju- 

a 0 
0 —a 

F}. | Lqo | = 1 and so the p = oc term contributes l/|Lj. 

Suppose p ^ t. If p 0 S, we already noted that \L p \ = 

\H°(Gq , M)\ and so the ratio in the formula is 1 for such 
primes. 

llpj= l and pe S, then \L p \/\H<>(G Qr , M)\ = |iC(G Ql „ M)\/\H%G Qp , M)\ 
I H 2 (Gq p , M) | (by local Euler characteristic formula) = \H°(Gq p , M*) \ 

(by local Tate duality). 

Thus, the only hard part lies in computing the p = £ con- 
tribution to the formula. (Note that the “global” terms, e.g. 

\H°(Gq, M) | will typically be 1, since we assume that p is abso- 
lutely irreducible, whence the centralizer of M consists of scalar 
matrices, but the only such of trace 0 is the zero matrix.) 

In the case of Flj, the theory of Fontaine and Lafaille is used, 
whereby they work with a category equivalent to that of fi- 
nite flat VE(F)[(TQj-modules, namely whose objects are W(F)~ 
modules D of finite cardinality together with a distinguished 
submodule D° and VE(F) -linear maps 0_i : D — > D and (j ) o : 

D — » D satisfying 0 _i|do = and Im((j)- 1 ) + Im((j) o) = D. 

This reduces the computations to linear algebra, and it turns 
out that 

\Lt\ 


Qr Ad°(M 


= in 


As for Hj s , if we let W < Ad°(p) = Mbe{^|j * J then 
this is a GQ^-submodule since p restricted to G q £ has the form 
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( Y'l/i -j * \ 

0 ^ j . Here the ordinariness of 

p at i translates into ^ 1,^2 being unramified characters (i.e. 
trivial on /<?)• Since 

( r s \ / 0 a \ f r s \ 1 _ / 0 ra/t\ 

\0 t ) \ 0 0 ) \ 0 t ) = \0 0 )’ 

the Gc^-action on W is via the character (t)- 

If we want to consider lifts to F[e\ that are ordinary at i too, 
then we need to set 


L, := ker(H 1 (G Ql ,M) -> H\l e , M/W)). 

Consider now 


H\G q „M) 

0 — Hl(G Q „ M/W) — H\G q „ M/W) — H\I e , M/W) 

Then L[ = ker(res o 9). 

The short exact sequence of Gc^-modules 

0 -> W -> M -> Af/IT -> 0 


yields a long exact sequence of cohomology groups: 


0 - H°(G q „ W) - l/ 0 (G Q; , M) -> H°(G q „ M/W) - 

H\Gq„W) -> H\G q „M) -> Im(9) -* 0 . 

Since the alternating product of the orders is 1, 

lf = 1 g° (go, ) [ I (Gq„ M/W) \\HHG Qt ,M)\ 

1 1 jl \H°(G Qr M)\mG Qr W)\ 
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Next note that \Im(res o 9)\ > \Im{9)\/\H} ir {GQ n M/W)\ = 

\Im(e)\/\H°(G Qt ,M/W)\. 

Putting this together, 

\L f \ = \H 1 (G Ql ,M)\/\Im(reso0)\ < |ff 1 (G Q( ,M)||ff o (G Q „M/lV)|/|/m(0)| 
= \H t >(G Qt ,M)\\H 1 (G Qt ,W)\/\H l >(G Qn W)\. 

Thus, 


I LA \H l (G Qn W)\ 

\H»(G q „M)\ ~ \H»(G qi ,W)\ 


H 2 (G q „W)\\F\ = \H°(G Ql ,W')\\F\ 


(using respectively the local Euler characteristic, noting Uf(|VE|) = 
1, and local Tate duality). 

To compute \H 0 (Gq v W*)\, let q = (^(Frij/^Fri)) - 1, 
which will turn out to be very important later! Let 0 G W* 
be bxed by Gq £ , i.e. (f){gr ) = gcj)(r ) for all g G G^ n r G F. If 
a = ^i(Fr^)/^ 2 (Er£), then by (f), (j){ar ) = 0(r) (y(^) cancels), 
i.e. 0(Qr) = 0, and so 0 factors through F/(qF). (This looks 
a little strange but later on we use the same calculation with p 
replaced by p : G q — > GL2(Z /£ n ) and so F is replaced by R and 
F/(c,F) by R/(c,R).) In any case, |ff 0 (G Q „ W‘)\ = \F/(c t F)\ 
and so 


\L t \ 

\H°(G q „M)\ 


< \F\\F / (c(F)\. 


The Big Picture. We obtained a formula, involving the or- 
ders of various Galois cohomology groups, for the tangential 
dimension r of (be. the smallest r such that is a quotient 
of W(F)[[Ti , . . . , T r ]]). We also computed how this dimension 
changes under the operation of adding (or removing) a prime 
to/from S. We next intend to show that there is a universal 
lift of type E parametrizing lifts associated to modular forms, 
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G q — > and then show that the map 7^ this 

produces is an isomorphism of local rings. First, however, let us 
obtain a criterion for establishing such an isomorphism, a nu- 
merical criterion involving on the one hand r and on the other 
a certain invariant of 7s. 



8 

Criteria for ring isomorphisms 


cvi 


Given the kind of p : G q — > GL 2 (F ) in which we are in- 
terested (i.e. absolutely irreducible and semistable) and a fi- 
nite set E of rational primes, we have obtained a universal lift 
£ : G q — > (77/2 (^s) of type E and we shall soon obtain a 
universal “modular” lift £' : G q — > GL- 2 (T s) of type E. By uni- 
versality, we get a morphism 0£ : — > 7s, which will be seen 

to be surjective. We want a numerical criterion that will suffice 
to show 0s is an isomorphism. 

By kind of p, we also wish to include that it is associated to 
at least one cuspidal eigenform /. This means that / gives us 
a lift of p to characteristic zero, say pj : G q — > GT^O), where 
O is the valuation ring of a finite extension of and is in Cp. 
We shall see later that there exists / such that pj is of type 
0, so of type £ for any E. The universality of and 7^ then 
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yields a commutative triangle of morphisms: 



O 


Let Co denote the subcategory of Cp consisting of local O- 
algebras (as noted earlier, every ring A in Cp is of the form 
W(F)[[Ti , ..., T r ]\/I and so is a local W (F)-algcbra - we are ask- 
ing that the map W ( F ) — > A factors through O, or equivalently 
that A is a quotient of some O [ [T\ , ...,T r ]]). Inspired by our in- 
tended application, we shall consider the category Cq, whose 
objects are pairs (A,tia), where A is in Co and 7Ta ■ A — >• O is 
a surjective morphism, and whose morphisms are morphisms 
<f : A — » B such that 



O 


commutes. 

Definition 8.1 Associated to an object (A,tta) of Cf, are two 
invariants, namely the cotangent space = (ker(7i a)) / (ker(7i a)) 2 
(a finitely generated O -module) and the congruence ideal tja = 
'KA{AnnAker{'KA)) (an ideal of O). 

Call a ring A in Co a complete intersection ring if A is free of 
finite rank as an O -module and is expressible as 0[\T \, ..., T r ]\/(fi , ..., / r ). 

Example: Let / be the cuspidal eigenform associated to the 
elliptic curve denoted 57B in Cremona’s book. This has level 
57, weight 2, trivial Nebentypus, and all its coefficients in Z. 

It therefore has an associated 3-adic representation pj : G q — > 
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GL2(Z 3 ). Let p : G q — > GLfifLf^f) be the mod 3 representation 
it produces. One computes that p is surjective as follows. 

The image of Fr 2 must have trace 02 (f) = 1 and determinant 
2. The only such elements of GL2(Z/3) have order 8. Next, the 
image of the local representation at 19 is given by Tate’s the- 
ory of elliptic curves since 19 is a prime of good multiplicative 
reduction and we see that 3 divides its order. Thus 24 divides 
the order of the image of p. One easily sees that the only sub- 
group of GL-2{7i/S) of order 24 is 6X2 (Z/3) but the image of 
Fr 2 has nontrivial determinant. Thus the image is the whole 
of GX2(Z/3) of order 48. 

It follows that p is absolutely irreducible, Since 577> is a 
semistable curve, p is semistable. Our theory will apply to this 
example. 

The theory of the next chapter allows us to compute that 

T% = {(x,y) e Z\\x = y (mod 3)} = Z 3 [[T]]/(T(T - 3)). 

Thus, 7g is a complete intersection ring, <f>7- 0 — Z/3, r\r, = (3), 
and so |$rj = |Z 3 /r/r 0 |. 

Amazingly, these sorts of properties turn out to hold in gen- 
eral, and even more strikingly there is a simple numerical cri- 
terion that is sufficient to establish the kind of isomorphism we 
covet. 

Theorem 8.2 (Wiles, improved by Lenstra) Let (j) : R — T 
be a surjective morphism in Cq. Assume that T is free of finite 
rank as an O -module and that t]t (0) (so \0 /t)t\ is finite). 
Then the following are equivalent: 

(a) |4>i?| < \0/r) T \, 

(b) |$ r | = \0/p T \, 

(c) the map <f is an isomorphism of complete intersection 
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rings. 

The theorem is established via the following lemma: 

Lemma 8.3 Suppose that A and B are in Cq and that there is 
a surjection <f : A — ^ B . 

(1) 4> induces a surjection <j> : V and so |d>^| > \&b\- 

hA C r] B . 

(2) I'M > \0/va \ • 

(3) Suppose B is a complete intersection ring. If <j> is an iso- 
morphism and is finite, then <j> is an isomorphism. 

(4) Suppose A is a complete intersection ring. If ija = ijB 
(0) and A and B are free, finite rank O -modules, then <f is an 
isomorphism. 

(5) Suppose A is free and of finite rank as an O -module. Then 

there exists a complete intersection ring A mapping onto A such 
that the induced map is an isomorphism. 

Proof: We show how the lemma implies that (a) implies (c). 
That (c) implies (b) will come out of the proof of the lemma 
later. That (b) implies (a) is trivial. 

\0/tjt\ > \&r\ > | d > r | > \0/tjt\ (*), 

by what we are given, together with (1) and (2). Thus \0/ijt\ = 
\<&t\, whence 

PVI = VI = VI ^ \°/nf\i 

by (5) and (2). But since by (1) pp C p T , it must be that 
It = V fi which by (4) implies that the map T — >• T is an 
isomorphism and so T is a complete intersection ring. 
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Another consequence of (*) is that |<f>#| = \&t\, and so by 
(3), (f) : R ! — > T is an isomorphism. QED 
Now we prove the lemma. For this we need first to know 
Nakayama’s lemma and what Fitting ideals are. 

Lemma 8.4 (Nakayama’s Lemma) Let A be a local ring and 
M a finitely generated A-module. If I is an ideal of A such that 
IM = M , then M = 0. 

Proof: Let mi, ..., m n be a minimal generating set for M . Since 
m n G M = IM , we can write it rimi + ... + r n m n with r ? ; G I . 
Then (1 — a n )m n = rimi + ... + r n _im n _i. We cannot have 
both a n and 1 — a n in the maximal ideal of A and so 1 — a n 
is a unit, but then m n is in the submodule of M generated by 
a contradiction. QED 

Corollary 8.5 Suppose ifj : M — > N is a homomorphism of 
finitely generated A-modules and I an ideal such that Mf IM — > 
N/IN is surjective. Then if is surjective. 

Proof: Apply Nakayama’s lemma to the cokernel. QED 
Note that M/ IM = M<S>aA/I. In our context we shall apply 
the corollary by establishing that a map of A-modules is sur- 
jective after tensoring with O or F (which via tta are quotients 
of A). 

Definition 8.6 Let R be in Co and M be a finitely generated 
R-module. This means that there is an exact sequence 

0 -> M' R n -> M -> 0 

of R-modules (i.e. a presentation of M). 

The Fitting ideal of M , Fittji(M), is defined to be the ideal 
of R generated by det{v i, ..., v n ) as the V{ G R n run through M' . 
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Exercise: (i) Show that this ideal is independent of the choice 
of presentation. [Hint: from two presentations R m — » M — > 0, 
R n — » M — > 0, form a presentation R m+n — > M — > 0.] 

(ii) Suppose R = O and so without loss of generality 

M = O r © (0/ A ni ) © ... © (0/A nfc ), 

where A is a uniformizer of O. Show that Fitto(M) = (0) if 
r > 0 and = (A ni +- n *) if r = 0. 

Deduce that |M| = \0/Fitto{M)\. 

Proof: (1) The composition ker(7T A ) — her {it b) — ^ factors 

through $ A . The mere existence of a map Ann A ker (7r a) 
Annsker^s) implies that r) A Q ©b- 
(2) We note that FiUr{M) C Annn(M). This follows because 
if we take a presentation 

0 -> M' -> R n -> M -> 0 

and let xi,...,x n £ M be the images of the standard generators 
of R n and let d = det{cif) where (c,y ) = (vi, . . . , v n ) with each 
Vi £ M\ then J2j CjjXj = 0 and so dl n = (dq )(©,-) (where (d tJ ) is 
the adjoint-transpose of (c^)) multiplies the vector with entries 
Xj to 0, whence dxi = 0 for 1 < i < n, and so since the X{ 
generate M, d £ Ann^fM). 

Secondly, note that 7r A (Fitt A (M)) = Fitto(M ©^ O) (where 
7£4 makes O into an ©-algebra) , since as an O-module M ® A 
O = M/(ker(7r A )M ) is defined by the same relations as those 
defining M as an ©-module). 

In particular, ker(ir A ) ©a O = and so 

Fitt 0 (® A ) = 7r A (FiU A (ker(7r A )) C n A (Ann A ker(7r A )) = r) A , 

whence |<f>n| = |0 /F^o(^>a)| © | O/^l, the first equality com- 
ing out of the exercise above. 
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[Note that the “(a) <G=> (b)” part of the main theorem 

follows since if R surjects onto T, then always |$r| > \&t\ > 

\0/r] T \- } 

(3) Consider U = 0[[Ti, T r ]\ as in C* Q by taking i t b to be the 
map sending each T- L i— > 0. Since B is a complete intersection 
ring, there exists a local homomorphism v B : U — > B with 
kernel (/i, ...,/ r ). Letting 6* = v B (Ti), then bi G ker( 7 r B ). Since 
0 : <f>n — > Tg is an isomorphism, there exists a t G /cer(7Tn) 
mapping to and the images of cq in <f>^ generate <f>n- 

Now define z/4 : U — » A by sending R 1— > a*. This gives a 
surjection z/4 : and so by Nakayama z/4 is surjective. 

We next establish that ker{y B ) C ker(vA ) (and so are actually 
equal) . 

Since Qjj = O r and <f>n is finite, ker(vA ) has r generators, 
say g u ... , g r . Pick g u . . . , g r G ker(v A ) mapping to hi, . . . , g r . 
Since ker( v A ) Q ker{y B ), (hi,. ..,&•) = for some 

M G M r (U). Let M = M (mod (Ti, ..., T r )), i.e. the ma- 
trix of constant terms. Then (hi, . . . , g r ) = (/1, . . . , g r )M. Since 
these generate the same rank r submodule of finite index in 
Tp, det(M) G O x . Thus M is invertible and we are done. 

Finally, this means that vav b 1 is well-defined. It is an inverse 
to (f) and so 0 is an isomorphism. 

(4) We first claim that ker{itA ) d Ann Aker (tt a) = 0. This is 
proven as follows. Suppose x G ri a , % 7^ 0. Say x = 7174(2/), 
where x' G Ann Aker (zr a)- If a G ker{yA ) Cl Ann Aker (71 a), then 
since x — x' G fcer(7ru), 0 = a(x — x') = ax (since a G ker{iiA) 
and x' G Ann Aker ( tt a))- Thus a is O-torsion, so a = 0. 

Thus the restriction ha '■ AnnAker{iTA) V A is injective. The 

definition of 77^4 makes it surjective. Since tja = g b, this isomor- 
phism translates into the restriction of (f) : Ann Aker (n a) — > 
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Annsker^B) being an isomorphism. 

So we have an exact sequence 0 — > ker((j)) © Ann Aker {it a) — > 

A, with the cokernel Aj (. ker{4>)®AnnAker{ / KA )) = B / {(j){AnnAker{TT fa)) 
B / {Annskeri^B)) r —> Endoker^s)- Thus the cokernel is tor- 
sionfree, giving us splitting over O. If we now define A A = 
hom o(A,0), then one can see explicitly that A A = A as A- 
modules (this says that complete intersection rings are Goren- 
stein). We therefore get a dual exact sequence: 

A — > (. ker((/))) A © {Ann Aker {tia))^ — > 0. 

Tensoring with F (the residue ring of A) gives 1 = dimp{A®A 
F ) and {Ann Aker {7 Ta)) a< S>aF fa 0 (since t]a fa 0). Thus {ker{<fa)) A ®A 
F = 0. By Nakayama’s lemma and dualizing, ker{q b) = 0, and 
we are done. 

(5) As seen in (2), we can write A as a quotient of U = 

0[[Ti, ..., T r ]], where the T,[ map to elements of ker{iiA)- The 
idea is to choose /i, ..., f r generating the kernel of the induced 
d©(= O r ) — ■» d© and then to lift these to U and set A = 

U/{fi , ..., f r ), a complete intersection ring if we ensure that A 
is finitely generated. 

Let ai,...,a r be O-module generators of ker{iTA)- Let V = 

0[Ti , ..., T r ], Dehne <f> : V — >■ A by T :j i— >■ aj. Then (f) is sur- 
jective. Pick /i,...,/ r G ker{(f)) and let m be the maximal de- 
gree occurring. Since aj G ker{r^A ), aj = hi{ai, ...,a r ) for some 
linear polynomial h t . Replace fa by fa + T™hi — Tfa l+2 . Then 
V/{f\ 1 ..., fa) is finitely generated as an O-module. Complete 
at (A, Ti, T r ) to get A = U/{fa , ..., f r ), a finitely generated 
0-nrodule. Note that A —> A induces an isomorphism on the 
cotangent spaces since the linear terms of the fa generate the 
kernel of the induced map d© — » d©. 
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QED 

There is one problem in applying the Wiles-Lenstra criterion 
to our set-up, namely that 37s and 7^ are certainly W(F)- 
algebras but may or may not be O-modules. We can fix this by 
setting R = <S>w(F) O and T = 7^ <S>w(F) O , and checking 

the following exercise. 

Exercise: 

(a) 37s — * 7s is an isomorphism if and only if the induced 
map R — > T is an isomorphism. 

(b) 7s is a complete intersection ring if and only if T is one. 

(c) R and T are the universal deformation rings (for type E 
and type E modular lifts respectively) if we restrict Mazur’s 
functor to Co- 

With these rings R and T, we now need to interpret and 
t]t with the aim of proving \$>r\ < \0/tjt\- 

Lemma 8.7 Let E be the fraction field ofO. There is a natural 
bijection from 

Hom 0 ($R,E/0) - Hl(G Q ,Ad 0 (pf) 0 E/O). 

Thus, | Phi R \ = \Hl(G Q ,Ad 0 (pf)®E/O)\. 

Proof: Recall how at the start of chapter 7 we showed how 
given p : G — » GL n (F ) a lift a : G — > GL n {F[e\) yielded 
a 1-cocycle G — > ker(GL n (F[e\) — > GL n {F )) by considering 
the difference between g and the trivial lift of p. Likewise, 
here we have a trivial lift of pf : G q — > GL^iO) to R/p 2 , 
where p = ker(R — > O), since R (and so R/p 2 ) is an O- 
algebra. Using this, each lift of pj to R/p 2 gives a 1-cocycle 
Gq — > ker(GL 2 (R/ p 2 ) — >• GL2(0)) = M 2 ($r). Strictly equiv- 
alent lifts differ by a 1-coboundary and the process can be re- 
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versed. 

Next, note that / G Hom(f&R, O/X 1 ) together with this de- 
fines a class in Ad°(p/) 0 0/ A n ), since we are only 

considering lifts of type E. Taking the union (direct limit) 
over all n yields the lemma. Another way of seeing this is to 
consider lifts of pf to 0[e]/(A n e, e 2 ). Every such lift yields a 
map R — > 0[e]/(X n e, e 2 ). This restricts to a homomorphism 
p — > (0/ \ n )e such that p 2 maps to 0, and so this factors 
through <f ) r . QED 

We can now use the Wiles-Lenstra criterion to reduce proving 
that — > 7s is always an isomorphism to the case E = 0. Let 
= S U {p},R' = ® w ( f ) O, V = 7jy 0vr(F) O. We have 

the following commutative diagram, reflecting the facts that 
the larger rings parametrize larger sets: 


R ' — - V 

R *T 

Theorem 8.8 There exists c p G O satisfying 

| < £ ) i?'|/l < ^l < |0/cp|, rjT' Q c p t]t- 

We shall specify c p shortly. The amazing thing is that the 
same c p arises on either side. This is what makes our reduction 
work. 

Corollary 8.9 Suppose R — T is an isomorphism. Then so 
is R! — T'. Applying this one prime at a time, we see that 
if 3?0 -so- 7g is an isomorphism, then ^ '7s is an isomor- 
phism for every finite set E of primes. 

Proof: By the theorem, |$r/| < \&r\\0/c p \, which = \0/t}t\\0/c p 
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by Wiles-Lenstra since R —> T is an isomorphism. Applying 
the last theorem again, this < \0/rjT'\- Applying Wiles-Lenstra 
again gives that R' — > T' is an isomorphism. QED 
The big question is what c p works. If p £, then following 
theorem 7.15 we have that 

IKffl/l-I-fll < \H°(G Qr ,AP)\, 

where M = Ad °(pf) ® E/O. This //' is just the hxed points of 
M* under the action of Gq p and so is readily computed. 

Lemma 8.10 Suppose p ^ t. \H°(Gq p , M*)\ = \0/c p \ where 
c p = (p — l)( a p — (p + l) 2 ) if p is unramified in p (and so pj), 
and = p 2 — 1 otherwise. 

Proof: In the case that p is unramified, I p acts trivially and so 
the action of Gq p factors through < Fr p >. Thus we need the 
order of the submodule of M* hxed under Fr p . The first thing 
to note is that the representation Ad °(p/) = Sym 2 (8) det~ l 
and so we just need the hxed points of Sym 2 . The eigenvalues 
of Fr p are a 2 , a p /3 p: j3 2 as given below and so of 1 — Fr p are 
1 — a 2 , 1 — a p l3 p: 1 — /3 2 . Thus the determinant of the action is 
(1 — a 2 )( 1 — a p /3 p )( 1 — (3 2 ) = c p above. 

The ramified at p case is similar. QED 

Remarks: Note that c p ^ 0. This follows from the Petersson- 
Ramanujan bound \a p \ < 2 y/p. 

This will be our choice of c p for p t. To give some perspec- 
tive of how Wiles’ desired inequality actually hts into the big 
picture of number theory, relating orders of Selrner groups to 
special values of L-functions, we next dehne the L-function of 
the symmetric square of pf. 

Definition 8.11 Suppose f = E a n q n is a cuspidal eigenform 
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of weight 2 and trivial Nebentypus. Let a p , /3 P be the eigenvalues 
of Frobenius, so they satisfy a p -\-j3 p = %,(/) and a p /3 p = p. The 
pth Euler factor of L(s,pf) is 1/((1 — a p p~ s ){l — (3 p p~ s )) = 
1/(1 — a v p~ s + pp~ 2s ). The pth Euler facor of Lis , Sym 2 ipf)) 

«!/(( l 

The amazing fact is that up to a power of p, c p is the pth 
Euler factor of L(2, Sym 2 (pf)). In this way we can restate the 
desired inequality in terms of the order of a Selmer group being 
bounded by a special value of an L-function, and we have a case 
of the Bloch-Kato conjecture, a vast generalization of the Birch 
and Swinnerton-Dyer conjecture. Unfortunately, this observa- 
tion led Wiles off on a wild goose chase for several years, seeking 
to prove this case of Bloch-Kato by generalizing ideas of Flach 
to construct geometric Euler systems, similar to those that had 
been successful in proving cases of the Birch and Swinnerton- 
Dyer conjecture earlier. We shall return to this observation once 
we have introduced T^. 

Lemma 8.12 If p = l, then if p is not both good and ordinary, 
then \Hl s \/\Hj\ = 1 and so we set q = 1. If p is both good and 
ordinary, then we set q = a 2 — (£+ l) 2 . Up to units, this is the 
same as a 2 — 1, where cq is the unit root of X 2 — a?X + 1 ( there 
is one such by the ordinariness of p at I. 

Froof: The factor in Wiles’ formula for the order of the Selmer 
group goes up by \Hl s \/\Hj\ in going from £ to If p is not 
good, then in definition 7.6 Hj was taken to be H] s . If p is not 
ordinary, then its lifts are not either and so a lift is semistable 
if and only if it is good. 

In the good, ordinary case, we already calculated that, if M = 
M\p f ) 0 O/A", then \H}(G Qn M)\ = \H 0 (G Ql ,M)\\O/X n \, 
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whereas \HI s (Gq v M)\ < \H°(Gq v M)\\0 /\ n \\0 /(\ n ,cf)\, where 
C£ = (ipi /ip 2 )(Fre) — 1. Since = iff 1 and ^ 2 {Frf) is the unit 
root of X 2 — agX + £, we get q = aj — 1. QED 
Once we describe the construction of 7s and hence compute 
r/y, we shall be able to show the other half - that r/y/ C c p 7/y. 
This then reduces us to having to prove the isomorphism in the 
case of S = 0. For this we prove the following. In applications, 
R n and T n will be rings obtained by using sets S containing 
primes of a specific form, for which we can control the structure 
of and 7s . The proof of this result, while considerably easier 

than the earlier proposed approach via “Flach systems” , is still 
quite involved and forms the content of the companion paper 
by Taylor and Wiles [34] . It is where the gap that remained for 
16 months, is fixed. 

Theorem 8.13 (Wiles, improved by Faltings ) Let (f) : R — T 
be a surjection in Cq with T finite and free over O. For any 
r > 0 regard R and T as 0[[S\, S r ]\-modules by letting the 
Si act trivially. Let t o n (T) denote (1 + T) pn — 1. 

Suppose there exists an integer r > 0 such that for every n > 1 
there is a commutative diagram of surjective homomorphisms 
of local 0[[Si, ..., S r ]\- algebras 

Rn T n 



that has the following properties 

(i) the induced maps R n j (5i, ..., S r )R n —> R and T n / (Si, ..., S r )T n 
T are isomorphisms; 

(ii) the O -algebras R n can be generated by r elements; 



8. Criteria for ring isomorphisms 


cxix 


(in) the rings T n /(u n (Si), ...,c v n (S r ))T n are finite free O [[Si, S r ]]/(u n (Si), 
algebras. 

Then <f : R — > T is an isomorphism of complete intersection 
rings. 
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Fix a continuous homomorphism p : G q — > GL- 2 (F), where F is 
a finite field of characteristic i > 3. We assume as usual that p is 
absolutely irreducible, semistable (and so in particular^ odd), 
and modular (i.e. there is a cuspidal eigenform / = Fci n Q n 
such that trp(Fr p ) = a p for all but finitely many primes p). 
Letting K = Q(\/=b I), where we pick the positive sign if £ = 1 
(mod 4) and the negative sign otherwise, we shall also want to 
assume that the restriction of p to Gj< is absolutely irreducible. 
In fact our assumptions already imply this if t > 5 but not 
necessarily in the critical case of interest to us, t = 3. 

Fix a finite set £ of primes. We wish to obtain a lift of type 
£ that parametrizes all lifts of type £ associated to cuspidal 
eigenforms. 

Recall that (N) denotes the C-vector space of cusp forms 
of weight 2, level V, and trivial Nebentypus. Let T' = T’(N) 
denote the ring of endomorphisms of this space generated by 
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the Hecke operators T n for n prime to £N. Recall what has been 
shown already. 

Theorem 9.1 Let m be a maximal ideal of T' and suppose 
T'/m has characteristic t. 

(a) There exists a continuous semisimple homomorphism p~ m : 
G q — > GL 2 (fF' /m), unramified outside IN , such that tr p~ m (Fr p ) 
T p and detp~ m (Fr p ) = p for all p flN . 

(b) Suppose p ~ n i is absolutely irreducible. Let Ffi be the m- 
adic completion of T' . There exists a continuous homomor- 
phism p m : G q — > GL 2 (Tf n ), unramified outside £N , such that 
trp m (Fr p ) = T p and detp m (Fr p ) = p for all p f£N . 

Proof: Most of this was established in theorem 4.28, where we 
obtained a free T Qf-module of rank 2 on which G q acts as 
above. The injection T' T yields an injection Tfi r —> T ^zQf- 
Since the traces of Frobenius land in 7^, by Chebotarev’s den- 
sity theorem (that the Frobenius elements generate Gq_s for 
any finite S) the trace of any element lies in 7^. This to- 
gether with the absolute irreducibility of p~ m gives, by a result 
of Carayol, that there is a representation into GL- 2 {Fff) with 
the same trace as our given representation into GL-ifiF <S>z Qe)- 
QED 

Now let N = N(p) be the product of the primes at which p 
fails to be good. Since p is semistable, N(p) is squarefree and 
so N = N(p) if p is good at £, and £N{p) otherwise. 

Theorem 9.2 There exists a unique ring homomorphism a : 
T'{N) — > F such that a(T p ) = trp(Fr p ) for all p f\ £N . Let 
m = kera, a maximal ideal ofT'(N). Then p is isomorphic to 
p m :G Q ^GL 2 (TfN)/m)^GL 2 (F). 
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Proof: This is simply a restatement of the work of Ribet and 
others showing that if p is modular (as we have assumed), then 
it is associated to a cuspidal eigenform / of weight, level, and 
Nebentypus predicted by Serre’s invariants, in this case weight 
2, level N, and trivial Nebentypus. Now define a as the usual 
eigencharacter, T(f) = a(T)f for T £ T'(N). QED 
Next we introduce S. Let N s = N^ip) = Ylp Up , where n p is 
the exponent of p in N if p S, = 2 if p £ ^ £, and = 1 

if p £ S,p = £. In particular, N$ = N. Since N\N-£, there is a 
map r : T f (Njf) — » T'(N). Set ms = r -1 (m), a maximal ideal 
of T'(Nz). 


Theorem 9.3 Let p be a lift of p to a ring A in Cp- The fol- 
lowing are equivalent: 

(a) p is unramified outside C/Vs and there exists a ring homo- 
morphism a : T'(Ny) — >• A such that trp{Fr p ) = a(T p ) for all 
P JtiNv. 

(b) There exists a ring homomorphism a : T'(A^s)m s — > A 
such that p is isomorphic to the representation obtained by com- 
position of p ms with a. 

When these hold, (a) determines a uniquely, a extends a con- 
tinuously, and p is a lift of p of type S. 


Proof: (b) implies (a): Composition of the completion map 
T\Ny) — > T'(fVs)m s with a yields a (whose traces of Frobe- 
nius are as stated since this holds for p ms ). 

(a) implies (b): Since T p maps to trp(Fr p ) (p not in S and 
not dividing £N(p)) either way around, the following diagram 
commutes: 
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T'{N ) F 

It follows that a(ms) lands in the maximal ideal of A, which 
is exactly what we need for a to have a continuous extension 
to a. 

The tricky thing here is to establish that p is of type S. This 
is controlled by the form of N e- Namely, Jo(Njf) has good re- 
duction at all primes not dividing TVs and semistable reduction 
at all primes p such that p 2 /AG- Our form of then implies 
that the reduction at i is semistable and that the reduction at 
any prime ^ £ is the same as that for p. The final piece needed 
is an analogue of Tate’s elliptic curves for semistable abelian va- 
rieties (due to Grothendieck, LNM 288) that translates these 
reduction properties over to the form of the associated local 
Galois representations, just as we did in chapter 5. QED 

Definition 9.4 Call a lift of p satisfying the equivalent condi- 
tions of the previous theorem a modular lift of p of type S. 

Now we are in a position to produce a universal such lift. 
Let To be the subheld of F generated by {trp(g)\g e Gq}, 
which is by Chebotarev the same as the subheld generated by 
the traces of Frobenius. Thus T’{Njf )/ ms = Fq. Set 7s = 
T'(Af s ) m E ®ff(r 0 ) W(F). 

Then Tp, is an object in Cp and the map T\Njf ) ms — > 7^ 
composed with p ms : G q — > GL 2 (T'(N^) mi: ) yields a lift p™ od : 
G q — > GL-ifTp) of p which is modular of type S. 

Then the previous theorem translates to the following (which 
says that pft od is the universal lift of p of type £): 
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Theorem 9.5 Given a modular lift p of p of type S to a ring 
A in Cp, there exists a unique morphism 0 : Tp — » A such that 
pm°d C0m p 0se( i w %th 0 is strictly equivalent to p. 

Remarks: (1) If S C S', then by universality we get a unique 
morphism Tjy — » 7s sending p r f, od to pft od . This is surjective 
since 7s is generated by the traces of the representation. 

(2) Also by universality, there exists 0s : — ► 7s sending 

the universal lift of type S to p™ od . By the same reason as in 
(1), 0s is surjective. 

(3) Since T'(Njf) is reduced (i.e. no nilpotents other than 0) 
and a free Z-module of finite rank (recall this follows since it is 
a subring of EndS 2 {N , Z) by the (/-expansion principle), 7s is 
reduced and a free W (F)-module of finite rank. 

We can even give an explicit construction of 7s. Given p, 
consider the set of newforms / such that pf : G q — > GL 2 {Of) 
is equivalent to a lift of p of type £. This is nonempty since by 
Ribet there is such a cuspidal eigenform giving a lift of type 

0, and so necessarily of type £. Inside II Of consider for each 
prime p not in £ and not dividing £N(p) the element ( a p (f )). 
Then 7s is the W (F)-algebra generated by all such elements. 

Example: Continuing our example where p : G q — > GL2(Z/3) 
is associated to the action on the 3-division points of the elliptic 
curve E denoted 57 B in Cremona’s tables, y 2 + y = x 3 — 2x — 

1. Let J'e denote the associated modular form. We need to 
consider what other newforms of level dividing 57 and weight 
2 produce the same p, which is the same as being congruent 
(mod 3) to fp. As noted in Cremona’s tables, there is precisely 
one other elliptic curve of conductor dividing 57 that produces 
such a newform, namely 57C. Since the forms are congruent 



9. The universal modular lift 


cxxv 


(mod 3) but not (mod 9), we get that 


r 0 ss {( x ,y) e Zl\x = y (mod 3)} a Z 3 [[T]]/(T(T - 3)). 

In particular, this is a complete intersection ring and rjTi = 
3Z 3 . 

If, in a general situation, there are exactly two newforms / 
and g (of allowable weight and level) producing the same p : 
G q — > GL^ifLjP) and they are congruent (mod £ n ) but not 
(mod £ n+1 ), then T 0 = Z £ [[T]]/(T(T - £ n f) yielding r/r 0 = £ n Z t . 
In general, t]t^ measures congruences between the correspond- 
ing newforms, and this is why it is called the congruence ideal. 

Besides the properties obtained so far for 7s (see remark (3) 
above), we need to establish three further properties of 7s. The 
first one we needed in reducing to the minimal case. Let T = 7s 
and V = 7s/, where E' = EU {p}. 

Theorem 9.6 With the choice of c p G O from the previous 
chapter, rjT' = c p t]t- 

Proof: Given p r f od : Gq — > GL2(7s), let Ms denote the corre- 
sponding GQ-module, a free rank 2 7s-module. It comes with 
an alternating pairing (the Weil pairing) Afs x Ms — > O, which 
we shall denote by (x,y) i— >< x,y >s- Let ps = A:er(7s — > O) 
and Ls = A7s[ps] (i.e. the elements of Ms annihilated by ps), 
a free Ts/ps = O-module of rank 2. 

Claim: If {x, y} is a basis for Ls, then r\T =< x,y >s- 

To compare tjt and tjt'i we need to construct a homomorphism 
(3 : Ms' — > Ms. Consider for now the most general case p 
C,p KN{p). Then N s = N s p 2 . Let X = X 0 (N S ), X' = X 0 (X S ,), 
and J, J” be the respective Jacobians. The map on the upper 
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half-plane sending r i— > (r, pr, p 2 r) induces “degeneracy maps” 

X' — > X 3 . By Albanese functoriality, this yields a map J' — > J 3 . 

(In the cases p ^ £,p\N(p) or p = £, p good and ordinary, we 
do the same with just r i— > (r,pr). For p = £,p otherwise, we 
use just ri— >r. ) 

This then induces a homomorphism Ti(J')<S>z e O — > (Tg( J)®z e 
O) 3 . By picking the right map (Ms ) 3 — > Ms, namely (1, — p~ l T p ,p~ 1 )^ 
we induce a map (3 : Ms' — > Ms- Let (3' : Ms — > Ms' be the 
adjoint of (3, i.e. satisfy 

< x,(3y >s=< / 3'x,y >s' 

Wiles computes (3(3' by looking at the effect on cuspforms. He 
finds that (3(3 ' = —p~ 2 (p — 1)(T^ — (p+ l) 2 ), which up to a unit 
is c p . In every other case, the same thing holds. 

Now, assuming that if {x, y} is a basis of Ms, then {(3(x), (3{y)} 
is a basis of M s' (which is a tricky problem in cohomology of 
modular curves to show that (3 is surjective), then we have 

m = (< P'{x)if3\y) >£') = (< x,(3(3'{y) >)s = c p (< x,y >s) = c p r] s 

QED 
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Our aim then is first to show that the homomorphism 3 ^ — > T$ 
is an isomorphism. This turned out to be the hardest part of the 
proof and the part that held up Wiles. Its proof follows from 
the work in Taylor-Wiles that establishes that it is enough to 
show that — » Tq is an isomorphism for “sufficiently many” 
Q. This is then established by carefully picking the “nicest” 
sets Q, for which we can control somewhat the form of and 
Tq, as follows. 

Suppose that F is a finite held of odd characteristic t and that 
p : G q — » GL2(F) is a representation. We shall be interested in 
specifying various auxiliary primes q that will always be such 
that <7 = 1 (mod t), p is unramified at q, and p(Fr q ) has 
distinct eigenvalues all in F. Call such a prime special. We can 
control lifts p of p to rings A in Cf at special primes as follows. 


Lemma 10.1 With the above assumptions, there is a basis of 
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M := A 2 on which p(G q ) acts diagonally. In fact, 


p(Q = 



o 

X2 


with XhX 2 : I q — > A* characters of order a power of £ that 
divides q — 1 . 


Proof: By Hensehs lemma, there exists a basis of M on which 
p{Fr q ) is diagonal, say ^ ^ y j • Let x G I q . Since p is unram- 
ified at q, p(x) G T n (A). Since T n (H) is a pro-£ group whereas 
the 1st ramification subgroup G\ of I q is a pro -q group, q £ 
forces p(G\) = {1}. Thus p q factors through Gq /Gi, whose 
structure we found earlier. In particular, 

p(Fr q )p{x)p(Fr q )~ 1 = p{x) q 

Let p(x) = 1 + (up) so that ajj G m, the maximal ideal of 
A. Let / be the ideal of A generated by a t j for i j. The 
above equation implies that Ayiy Aj 1 = qa^ (mod m )/. Our 
assumption that \ A j yields that G ml for all i j. 

Thus, / = ml, so by Nakayama / = (0). Hence, = 0 for 
i 7 ^ j , i.e. p(x) is diagonal. 

By the equation above again, i s °f finite order dividing 
q — 1. Since Xi = 1 (mod m), these orders are powers of £. 
QED 

Letting A q be the largest quotient of (Z /q)* of order a power 
of £, this is naturally a quotient of I q via the cyclotomic char- 
acter. The last result says that each Xi factors through A q . 

We shall be considering sets Q of primes q satisfying the con- 
ditions above. Letting Aq = U q& Q , choosing one of the Xi f° r 
each q G Q defines a homomorphism Aq — » A*. In particular, 
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if we take A to be the universal type Q deformation ring 3 £q or 
the universal modular type Q deformation ring 7 q, then this 
makes them LF(F)[AQ]-algebras. Letting Iq be the augmenta- 
tion ideal of W(F)[Aq\, our above lemma nicely relates 3 ?q and 
3^0. 

Theorem 10.2 A lift of p of type Q to A in Cp is unramified 
at all q G Q if and only if the kernel of the representing map 
3£q — > A contains Iq^q. 

Thus the natural map 3?q — > 3?0 induces an isomorphism 
^q/^q^q 3^0. 

Proof: First, note that in the above lemma, = Xi 1 since the 
determinant of p lands in both Fi(A) and in Z| and has finite 
order, so is trivial. Thus, one character -call it Xq ~ determines 
the other. A lift is unramified at q if and only if this character 
is trivial. Noting that Iq is generated by the A — 1(A G A q), so 
by the £ — 1(A G A q ,q G Q), we just need to show that = 1 if 
and only if (6 — T)x = 0(x G 3Rq), but this latter is Xq{^) x — x i 
which = 0 if and only if Xq{$) = 1- QED 
We shall write O for W (F) (in fact, sometimes we shall want 
to increase W(F) and this notation includes that possibility). 
If Q consists of r primes, consider the ring homomorphism 

O[[S 1 ,...,S,.]]^O[A 0 ] 

given by 1 + Si i— > a chosen generator of the cyclic group A q .. 
This then induces an isomorphism 

0[[Si, .... S r ]]/((1 + 501^1 - 1, (1 + S,.) |a " - 1) - O[A 0 ], 

under which Iq corresponds to < Si, ..., S r >. Via this map, we 
consider 3 ?q and Tq as 0[[5i, ..., S r ]\ -algebras. 
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Theorem 10.3 If the r primes in Q are special and also satisfy 
that 

Hf(Gq, Ad “Go)*) ^ ®re Q H ] \G Qq , Ad°(p)*)(t) 

is an isomorphism, then Rq is generated by r elements. 

Proof: Rq is a quotient of 0[[Ti, where d is the dimension 
of Hq(Gq, Ad°(p)). So we need an upper bound (of r) on this 
dimension. 

By Wiles’ formula, looking at the qth factor, q £ Q, gives: 

\L q \ _ |g 1 (GQ„Ad°(p))| 

\H«(G Q! ,M°(p))\ H»(G Qt ,Ad°(p))\ 

which = \H 2 (GQ q , Ad°(p))| since the Euler characteristic is 1, 
so by Tate duality = \H°(GQ q , Ad°(p)*)|, i.e. the order of the 
Gc^-invariants of ac/°(p)*- Since p is unramihed at q, this is the 
Fr^-invariants, and an explicit computation gives that this is 

in 

We get such a term for each q £ Q and so overall: 


dimpHQ^GQ, Ad°(p)) = r + dimpHQ^GQ, Ad°(p)*). 

Since Rq/IqRq = 3^0, 3£q is generated by the same number of 
elements as 3^0, which the last argument gives as dimH^GQ, Ad°(p)*). 
We just need to show this is r, but this follows from our hy- 
pothesis and the fact that dimH l (GQ q , Ad°(p)*) = 1. QED 
Moving towards the hypotheses of theorem 8.13, we want for 
each neZ and £ Hq(Gq, Ad°(p)*) a prime q depending on 
ijj such that 
(1) q = 1 (mod £) n 
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(2) q special 

(3) res q (ip) ^ 0 (to ensure f holds). 

Recall Chebotarev’s theorem. This states that if K/Q is a 
finite Galois extension and g £ Gal{Kf Q), then there exist in- 
finitely many primes q unramified in K/Q such that Fr q = g 
(at least up to conjugation). The above conditions then trans- 
late into finding o £ G q satisfying 

(1) a £ Gq(^) = ker(GQ — » Gal{ Q(O n )/Q)) y i a the isomor- 
phism Gal( Q(O n )/Q)) (Z /£ n )* 

(2) Ad° (p)(cr) has an eigenvalue ^ 1 

(3) V’W kip~ l)Ad°(p)* 

where Ad° is the homomorphism GL2(F) — >• Aut^M^iF)) = 
GLs(F) given by conjugation in M 2 (F). 

Exercise: Show that if p(<r) has eigenvalues A and /i, then 
Ad°(p)(a) has eigenvalues 1, A/g, ju/A. Deduce that the eigen- 
values of p(<r) are distinct if and only if Ad° (p)(o~) does not 
have 1 as an eigenvalue. The equivalence of both (2)’s above 
follows if we ensure F is large enough to contain all eigenvalues 
A, g. 

Since the scalar matrices act trivially by conjugation, Ad° 
factors through PGL 2 (F) and so the image of Ad°(p) restricted 
to can be considered as a finite subgroup of PGL-2{F ) 

and so of PGI^FV). All such subgroups are explicitly known, 
namely up to conjugation a subgroup of the upper triangular 
matrices, PST^F^n), PGL2( F^»), D n , A*, £4, A5. The idea is to 
show that in each case 

H\Gal(I</Q), Ad° (p)*) = 0 

either by direct calculation(Cline-Parshall-Scott) or by using 
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that Gal(K/Q ) has order prime to t (Schur-Zassenhaus). 
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Putting it together - the final trick 


Theorem 11.1 Fermat's Last Theorem holds, i.e. if a,b,c,n 
are integers such that a n + b n + c rl = 0 and n > 3, then abc = 0. 

More generally, the argument below establishes: 

Theorem 11.2 Every semistable elliptic curve over Q is mod- 
ular. 

Let E be the corresponding Frey elliptic curve. Consider its 
associated 3-division representation p : G q — > GT^F 3 ). There 
are two possibilities - either p is irreducible or reducible. Sup- 
pose first it is irreducible. 

Lemma 11.3 In this case, p restricted to G^yzj}) is absolutely 
irreducible. 

Proof: Let H be the image of p in PGL 2 (F^) = S,\. Suppose 
the lemma is false. Then H ^ S 4 else p would be surjective by 
the exercise that follows, and then the image of Gq^y^^ would 
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be SL 2 (F 3 ), which is absolutely irreducible. 

We consider the subgroups of £4 in turn, first noting that 
H C A 4 = PSL 2 (F 3 ) is impossible since then det(p) would 
be trivial. Also, H cannot be a subgroup of any S 3 since then 
the image of p is in a Borel subgroup and so p is reducible, 
contradicting our hypothesis. 

This leaves the only possibilities that H is dihedral of order 
8 or a subgroup of index 2 of it. Since E is semistable at every 
prime p 3, p(I p ) has order 1 or 3. Since 3 J[\H\, p(I p ) has 
order 1, i.e. p can only be ramified at the prime 3. Thus the 
abelianization H / H' is the Galois group of an abelian extension 
of Q of degree 4 ramified only at 3. By Kronecker- Weber, this 
has to be a subextension of some Q(£ 3 r), but this has degree 
over Q not divisible by 4. QED 

The hypotheses of the general case are now satisfied; namely 
we have a representation p : G q — > GT^F^) (£ = 3 in fact) 
such that 

(i) p restricted to Gk, K = Q(^ (— l)b _1 )/ 2 Z) is absolutely 
irreducible; 

(ii) p is semistable (follows from theory of Tate curves); 

(iii) p is modular (follows from Langlands-Tunnell) . 

Let E denote the set of primes that divide abc. Then P 3 00 : 
G q — > GL 2(Z 3 ) is a lift of p of type E. We showed in chapter 
11 that Us — > 7s is an isomorphism. This means that the 
universal map Us — 1 > Z 3 producing p 3 00 actually factors through 
7s, whence p 3 ~ is modular, i.e. there exists a cuspidal eigenform 
/ such that trp 3 °°(Fr p ) = a p (f) for all but finitely many primes 

p. 

Next consider p n 00 : G q — > GL 2(Z n ), where n is the Fermat 
exponent. Since trp n oo(F?’ p ) = trp 3 ^{Fr v ) = a p (f ) for all but 
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finitely many primes p, it follows that p n oo is modular (note 
Fadings’ theorem is not needed) and so p n is modular too. 
We showed, however, that p n is good at all primes 2 and 
semistable at 2, whence N(p n ) = 2. By Ribet, p n is then as- 
sociated to a cuspidal eigenform of level 2 and weight 2 and 
trivial Nebentypus, but there are none such. This contradic- 
tion concludes the proof except for the matter of dealing with 
the possibility of p being reducible. 

This possibility also troubled Wiles. Unable to handle it, he 
had the rest of his work typed up in the spring of 1993 (with the 
amusing typo “Fermat’s lost theorem”), when he spotted how 
an idea of Mazur would fix the problem. This is called the “3-5 
switch” . If Wiles had not spotted this, then Elkies’ alternative 
approach [?] of twisting the curves would have finished off FLT. 

Theorem 11.4 Suppose p^ : G q — > GL 2 (F^) is reducible. Then 
Pb '■ G q — » GL- 2 (F^) is irreducible. 

Proof: Suppose both are reducible. Then E( Q) has a GQ-stable 
subgroup of order 15. Under the 1-1 correspondence between 
Yq(N)(K) and isomorphism classes of elliptic curves over a field 
K together with a subgroup of order N defined over K, this el- 
liptic curve corresponds to a point of Yo(15)(Q). Now, Xq(15) 
has genus 1 and so is an elliptic curve (in fact 15 A in Cre- 
mona’s tables). Its rational points form a group of order 8. Of 
these 4 are cusps so not in Fq( 15)(Q), whereas the other 4 cor- 
respond to elliptic curves of conductor 50, which are therefore 
not semistable. QED 

A similar calculation to the first lemma above shows that p§ 
restricted to is absolutely irreducible. p$ is also semistable 

since E is. The one thing missing is that we do not know that 
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it is modular. Since GZ^Fs) is nonsolvable, the Langlands- 
Tunnell approach fails and we take an alternative route. 

Theorem 11.5 (“3-5 switch'’) There exists a semistable ellip- 
tic curve A over Q such that A[ 5] = E[ 5] as G^-modules and 
A[ 3] is irreducible as a G^-module. 

Once this is proven, then by the theorem proven so far, A is 
modular, so its ps is modular. But this is the same ps as for E. 
Thus the missing piece is filled in. 

Proof: The idea is to consider the collection of all elliptic curves 
A such that A[ 5] = E[ 5] as GQ-modules by an isomorphism tt 
that makes the diagram of Weil pairings commute: 

A[b] x A[b] ^ X7r j jE[5] x E[ 5] 





and note that this is in 1-1 correspondence with a curve Y' 
which is a twist of T(5), so of genus 0. Then we use Hilbert’s 
irreducibility theorem. QED 
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